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ABSTRACT 


Wald's  theory  is  used  to  find  truncated  sequential  test 
regions  for  the  hypergeometric  distribution.  These  regions 
are  then  evaluated  using  Aroian's  direct  method  of  sequential 
analysis.  Using  this  method,  the  important  test  properties 
(operating  characteristic  (OC)  function,  average  sample  num- 
ber (ASN)  function  and  the  distribution  of  the  decisive  sample 
number  (DSN))  are  found  exactly.  The  tests  are  compared  with 
other  similar  tests  (both  sequential  and  fixed  size)  and 
estimation  of  the  parameter  after  completion  of  the  sequential 
test  is  treated.  Numerical  examples  and  general  computer 
programs  are  also  included. 


TABLE  OF  CONTENTS 


ILLUSTRATIONS  v 

TABLES  vii 

TEST  PLANS  vixi 

CHAPTER  PAGE 

INTRODUCTION  1 

1 BACKGROUND  AND  REVIEW  OF  THE  LITERATURE  2 

1.0  Introduction  2 

1.1  The  Hypergeometric  Distribution  2 

1.2  When  the  Hypergeometric  Distribution 

Should  Be  Used  4 

1.3  Fixed  Size  Sample  Tests  for  the 

Hypergeometric  Distribution  6 

1.4  Extension  to  Three  Decision  Tests  10 

1.5  Previous  Results  with  Sequential 
Tests  of  the  Hypergeometric  ■ 

Distribution  14 

2 CONSTRUCTION  OF  THE  SEQUENTIAL  TEST  REGIONS 
FOR  TWO  AND  THREE  DECISION  TEST  PROCEDURES 

FOR  THE  HYPERGEOMETRIC  DISTRIBUTION  16 

2.0  Introduction  16 

2.1  Construction  of  the  Boundaries  for  a 

Two  Decision  Sequential  Test  16 

2.2  Construction  of  the  Boundaries  for  a 

Three  Decision  Sequential  Test  25 

2.3  Tests  of  Composite  Hypotheses  and 

the  OC  Function  38 

3 EVALUATION  OF  THE  TEST  REGIONS  USING  THE 

DIRECT  METHOD  OF  SEQUENTIAL  ANALYSIS  45 

3.0  Introduction  45 

3.1  The  Direct  Method  of  Sequential 

Analysis  45 

iii 


CHAPTER 


PAGE 


2.2  Truncation  of  the  Sequential 

Test  Region  49 

3.3  Obtaining  the  Test  Properties 

of  a Two  Decision  Test  Region  57 

3.4  Obtaining  the  Test  Properties 

of  a Three  Decision  Test  Region  66 

4 FURTHER  NUMERICAL  EXAMPLES  73 

4.0  Introduction  73 

4.1  Test  Plan  Examples  73 

4.2  Using  tne  Sequential  Test  Plans  74 

5 COMPARISON  WITH  OTHER  TESTS  97 

5.6  Introduction  97 

5.1  Comparison  with  Fixed  Size  Sample 

Tests  97 

5.2  Comparison  with  Sequential  Tests 

of  the  Binomial  Distribution  100 

6 ESTIMATION  OF  THE  NUMBER  OP  DEFECTIVES 

AFTER  TERMINATION  OF  THE  SEQUENTIAL  TEST  109 

6.0  Introduction  109 

6.1  History  of  Sequential  Estimation  110 

6.2  The  General  Method  of  Schmee  and  Goss  111 

6.3  Interpretation  of  the  Posterior 

Distribution  115 

6.4  Estimation  with  the  Hypergeometric 

Distribution  117 

6.5  A Numerical  Example  120 

REFERENCES  123 

APPENDIX  COMPUTER  PROGRAMS  USED  TO  DEVELOP  AND  127 

EVALUATE  THE  TEST  PT  ANS 


f] ^ 


I 


I ILLUSTRATIONS 


FIGURE 

PAGE 

1.1 

Error  Probabilities  for  a Two  Decision  Test 

6 

1.2 

Typical  OC  Curve  for  a Two  Decision  Test 

8 

1.3 

Error  Probabilites  for  a Three  Decision  Test 

12 

1.4 

Typical  OC  Functions  for  a Three  Decision  Test 

13 

2.1 

Typical  Two  Decision  Test  Region  for  the 
Hypcrgeometric  Distribution 

22 

2.2 

Sequential  Test  Region  for  the  Two  Decision 
Example 

26 

2.3 

A Typical  Three  Decision  Test  Region  for 
Hypergeometric  Distribution 

31 

2.4 

Sequential  Test  Region  for  the  Three  Decision 
Example 

34 

2.5 

A Three  Decision  Sequential  Test  Region 

36 

2.6 

Illustration  of  the  Independence  of  the 
Sobel-Wald  Sequential  Test  Region 

36 

2.7 

Armitage's  Sequential  Test  Region 

39 

2.8 

Sobel-Wald  Region  when  0ri+Qr2>^l+^2 

39 

2.9 

Simple  and  Composite  Hypotheses 

40 

3.1 

Typical  ASN  Function 

47 

3.2 

Typical  OC  Function 

47 

3.3 

Typical  Distribution  of  the  DSN 

47 

3.4 

Wedge  and  Right  Angle  Truncation 

52 

3.5a 

Improper  Truncation 

54 

3.5b 

Proper  Truncation 

54 

3.6 

Probability  Grid  for  a Two  Decision 
Sequential  Test 

59 

3.7 

Graph  of  the  ASN  Function  for  the  Two  Decision 
Example 

64 

v 


FIGURE 

PAGE 

3.8 

Graph  of  the  OC 
Decision  Example 

Function 

for  the  Two 

65 

3.9 

Probability  Grid 
Example 

for  the 

Three  Decision 

70 

3.10 

Graph  of  the  ASN 
Decision  Example 

Function  for  the  Three 

71 

3.11 

Graph  of  the  OC 
Decision  Example 

Function 

for  the  Three 

72 

4.1 

Test  Properties 

for  Test 

Plan  1 

76 

4.2 

Test  Properties 

for  Test 

Plan  2 

78 

4.3 

Test  Properties 

for  Test 

Plan  3 

80 

4.4 

Test  Properties 

for  Test 

Plan  4 

82 

4.5 

Test  Properties 

for  Test 

Plan  5 

84 

4.6 

Test  Properties 

for  Test 

Plan  6 

86 

4.7 

Test  Properties 

for  Test 

Plan  7 

88 

4.8 

Test  Properties 

for  Test 

Plan  8 

90 

4.9 

Test  Properties 

for  Test 

Plan  9 

92 

4.10 

Test  Properties 

for  Test 

Plan  10 

94 

4.11 

Graphical  View  of  the  Sequential  Test 

96 

5.  1 

Typical  Wald  Region  for  the  Binomial 
Distribution 

103 

TABLES 


1 


TABLES 


PAGE 


Log  Likelihood  Ratio  for  Different  Values 
of  x at  Trial  32 

Critical  Values  for  the  Two  Decision  Example 

Log  Likelihood  Ratio  for  Different  Values 
of  x at  Trial  52 

Critical  Values  for  the  Three  Decision  Example 
Effects  of  Region  Modification 
Critical  Limits 

Distribution  of  the  DSN  for  the  Two  Decision 
Example 

Properties  of  the  Two  Decsion  Example 
Properties  of  the  Three  Decision  Example 
Typical  Sequential  Sample 

Comparison  of  the  Fixed  Size  and  Sequential 
Tesus 

Binomial  and  Hypergeometric  Test  Regions 
for  Test  Plan  #3 

Binomial  and  Hypergeometric  Test  Regions 
for  Test  Plan  #5 

Comparison  of  the  Sequential  Hypergeometric 
Test  and  the  Binomial  Approximation 

Estimates  and  Confidence  Limits 


vii 


TEST  PLANS 


TEST  PLAN 

N 

Do 

D1 

a 

0 

PAGE 

1 

30 

5 

15 

0.05 

0.1 

75 

2 

30 

10 

20 

0.05 

0.1 

77 

3 

50 

2 

12 

0.05 

0.1 

79 

4 

50 

10 

20 

0.05 

0.1 

81 

5 

50 

20 

30 

0.05 

0.1 

83 

6 

100 

5 

20 

0.05 

0.1 

85 

7 

100 

10 

25 

0.05 

0.1 

87 

8 

100 

15 

30 

0.05 

0.1 

89 

9 

100 

25 

40 

0.05 

0.1 

91 

10 

100 

40 

60 

0.05 

0.1 

93 

viii 


INTRODUCTION 


This  report  is  concerned  with  sequential  tests  of  the 
hypergeometric  distribution.  The  report  is  organized  as 
follows.  Chapter  1 reviews  the  relevant  literature  and  gives 
some  necessary  background  material.  Chapter  2 describes  the 
construction  of  the  sequential  test  regions  for  the  hyper- 
geometric distribution  and  Chapter  3 applies  the  direct  method 
of  sequential  analysis  to  the  problem,  allowing  exact  evalu- 
ation of  the  test  properties.  Both  two  and  three  decision 
test  procedures  are  treated.  Chapter  4 presents  a variety  of 
numerical  examples,  including  fixed  size  and  sequential  test 
plans  and  their  respective  properties.  In  Chapter  5,  the 
sequential  test  properties  are  compared  with  both  fixed  size 
tests  and  a binomial  approximation  to  the  hypergeometric  prob- 
lem. There,  the  superiority  of  the  sequential  procedures  (with 
respect  to  average  sample  size  requirements)  is  clearly  shown. 
Chapter  6 examines  a method  of  estimation  which  can  be  performed 
at  the  completion  of  a sequential  hypothesis  test.  Computer 
programs  developed  for  these  procedures  are  given  in  the 
Appendix. 


1 


CHAPTER  1 


BACKGROUND  AND  REVIEW  OF  THE  LITERATURE 

1 . 0 INTRODUCTION 

This  chapter  introduces  the  hypergeometric  distribution 
and  reviews  the  relevant  literature.  The  first  two  sections 
discuss  this  distribution  and  give  examples  of  its  proper 
use.  The  following  section  pie  ents  the  development  of  fixed 
size  sample  tests  for  this  distribution.  These  tests  will  be 
compared  with  the  se^ential  tests  developed  here.  The  last 
section  reviews  some  of  the  previous  results  with  sequential 
tests  of  the  hypergeometric  distribution. 

1.1  THE  HYPERGEOMETRIC  DISTRIBUTION 

The  hypergeometric  distribution  is  the  appropriate  dis- 
tribution when  each  element  of  a finite  population  can  be 
considered  either  a success  (defective)  or  a failure  (non- 
defective) and  sampling  is  performed  without  replacement  from 
a finite  population.  It  is  assumed  that  there  are  D defectives 
in  a population  of  size  N.  If  a random  sample  of  size  n is 
drawn  from  the  population,  the  probability  that  it  will  con- 
tain x defectives  is  given  by  the  hypergeometric  probability 
mass  function: 


2 


3 


h(x;N,n,L>)  = 


Pi (N-D) Ini (N-n)  i 

N!x! (D-x) i (n-x) I (N-D-n+x) ! 

max (0,n-N+D) <x<min (n,D) 


(1.1) 


The  cumulative  distribution  function,  which  gives  the  proba- 
bility that  the  number  of  defectives  x found  in  a sample  of 
size  n is  less  than  or  equal  to  r is 

r 

P(x<r)  = H (r;N,n,D)  = E h(i,N,n,D).  (1.2) 

i~0 

Lieberman  and  Owen  (1961)  give  tables  of  n(x,N,n,D)  and 
H (r ,N,n,D)  for  N=1  (1)  50  (10) 100. 

This  dissertation  considers  inferences  about  D,  the  number 
of  defectives  in  a finite  population  of  known  size  N.  Infer- 
ences concerning  other  parameters  (e.g.,  the  population  size  N) 
can  also  be  made  by  generalizing  the  procedures  presented  hero. 

The  hygergeometric  distribution  has  many  important  appli- 
cations, one  of  the  most  important  being  acceptance  sampling. 
Grant  (1964)  gives  some  discussion  of  this  along  with  a table 
of  log  factorials  which  are  useful  for  calculating  hypergeometric 
probabilities.  The  hypergeometric  distribution  can  also  be 
used  for  nonparametric  tests  of  location  and  dispersion,  as 
explained  in  Owen  (1962)  and  Gibbons  (1971).  In  addition,  it 
is  used  for  tests  of  significance  for  the  equality  of  two 
unknown  binomial  proportions  and  for  tests  of  independence  of 


i 

? 
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two  binary  characteristics.  These  last  applications  arise 
because  the  hypergeometric  is  the  null  distribution  for  tests 
which  can  be  formulated  in  terms  of  a 2x2  table  in  which  one 
is  testing  the  independence  of  the  rows  and  columns. 

Johnson  and  Kotz  (1969),  in  their  book,  give  some  fur- 
ther applications  of  the  hypergeometric  distribution  along 
with  a somewhat  more  theoretical  development  of  the  distribu- 
tion, including  the  moments  and  the  generating  functions. 

They  also  present  discussion  of  different  methods  of  fixed 
size  sample  estimation  and  of  some  extensions  of  the  dis- 
tribution. A rather  complete  set  of  references  relating  to 
the  hyper geometric  distribution  is  also  included  there. 

1.2  WHEN  THE  HYPERGEOMETRIC  DISTRIBUTION  SHOULD  PROPERLY 

BE  USED 


1 

\ 

J 


i 


There  is  often  confusion  as  to  when  the  hypergeometric 
distribution  should  be  used  and  when  the  binomial  distribu- 
tion should  be  used.  As  mentioned  earlier,  the  hypergeometric 
distribution  should  be  used  when  sampling  without  replacement 
from  a finite  population.  If  the  population  being  sampled  is 
infinite  (or  is  large  enough  to  be  considered  so)  or  if 
sampling  is  done  with  replacement,  the  binomial  distribution 
should  be  used. 


The  important  distinction  to  be  made  between  the  binomial 
situation  and  the  hypergeometric  situation  is  in  the  probability 
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of  finding  a defective  at  each  inspection.  An  example  makes 
this  clearer.  Consider  a quality  control  engineer  who  wishes 
to  make  inferences  about  the  number  of  defectives  in  a lot 
of  100  items.  First  the  assumption  is  made  that  the  machine 
which  produces  the  items  produces  defects  with  some  constant 
probability.  If  20  items  are  to  be  inspected  and  they  are 
inspected  as  they  come  from  the  machine  (not  necessarily  con- 
secutive items) , the  binomial  distribution  should  be  used 
because  the  probability  that  a given  item  will  be  defective 
is  the  same  for  each  of  the  20  items  tested.  On  the  other 
hand,  if  after  the  lot  of  100  items  has  been  produced  , a random 
sample  of  20  items  is  selected  from  this  lot  and  inspected 
without  replacement,  the  hypergeometric  distribution  is 
appropriate.  This  is  because  the  probability  of  observing  a 
defective  is  no  longer  constant  for  each  of  the  items  inspected. 

In  general,  if  the  population  is  finite,  tests  performed 
without  replacement  using  the  hypergeometric  distribution  will 
be  more  powerful.  The  size  of  the  increase  in  power  is  re- 
lated to  the  ratio  of  the  sample  size  to  the  population  size 
(along  with  the  other  parameters) . If  this  ratio  is  less  than 
0.1,  the  difference  between  the  tests  will  in  most  cases  be 
small  enough  to  ignore.  Also,  if  the  ratio  is  small  enough, 
either  distribution  may  be  used  when  sampling  without  replace- 
ment. This  is  of  course  due  to  the  fact  that  the  hypergeometric 
distribution  will  approach  the  binomial  distribution  if  N is 


increased  and  D/N  remains  constant.  If,  however,  these  above 
conditions  are  not  met,  one  should  use  the  hypergeometric 
distribution. 

1.3  FIXED  SIZE  SAMPLE  TESTS  FOR  THE  HYPERGEOMETRIC  DISTRIBUTION 

By  using  the  tables  of  the  hypergeometric  distribution, 
one  can  find  the  fixed  size  sample  test  which  has  the  smallest 
necessary  sample  size  and  still  has  error  probabilities  which 
meet  the  desired  specifications.  A procedure  for  doing  this 
is  presented  below. 

In  a two  decision  test  there  are  two  types  of  errors  to 
be  considered.  These  are  shown  in  Figure  1.1. 

Decision  Based  on  Test  Results 


Ho 

H 

a 

True  State 

Hn 

NO  ERROR 

a ERROR 

of 

0 

Nature 

H 

a 

8 ERROR 

NO  ERROR 

Figure  1.1  Error  Probabilities  for  a Two  Decision  Test 

The  first  is  called  a Type  I or  a error  and  is  made  when 
there  is  a decision  to  accept  the  alternate  hypothesis,  H , 

cl 

when  in  fact  the  null  hypothesis,  HQ,  is  true;  a usually  denotes 
the  probability  of  such  an  error.  A Type  II  or  8 error  occurs 
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when  the  null  hypothesis  is  accepted  when  in  fact  the  alternate 
hypothesis  is  true;  B usually  denotes  the  probability  of  a Typo 
II  error.  The  following  notation,  however,  is  used  here.  Let 
a and  S denote  the  desired  probabilities  of  the  Type  I and  Type 
II  errors  respectively  and  let  a*  and  B*  denote  the  probabilities 
actually  given  by  the  fixed  size  sample  test. 

Guenther  (1969)  describes  in  detail  a method  of  obtaining 
the  desired  test  by  using  the  tables  of  the  hypergeometric  dis- 
tribution provided,  for  example,  by  Lieberman  and  Owen  (1961). 

The  method  is  straightforward  and  is  outlined  below. 

The  simple  hypotheses  to  be  tested  are 


Hq ; D=D<N/2 

versus  : D=D1>DQ 

The  requirements  of  the  desired  test  are: 
P (accept 


(1.3) 


H0)=Jl-a  if  D=D0 

(B  if  D=D1 


(1.4) 


This  expression  gives  two  points  of  the  operating  characteristic 
(OC)  function.  The  OC  function  gives  the  probability  of  accepting 
Hq  as  a function  of  the  true  state  of  nature.  Because  it  is  re- 
quired that  the  OC  function  be  nonincreasing,  it  is  clear  that 
a+B<l.  An  example  of  a typical  OC  function  is  shown  in  Figure  1.2. 
Actually,  the  OC  function  is  a step  function  in  this  case,  although 
it  is  shown  here  as  if  it  were  continuous.  This  practice  will  be 
used  throughout  to  make  the  graphs  easier  uo  read. 


Figure  1.2  Typical  OC  Curve  for  a Two  Decision  Test 

The  fixed  size  sample  test  plan  is  defined  by  the  sample 
size  n*  and  a critical  value  c*.  If  the  number  of  defectives 
found  in  the  sample  of  size  n*  is  greater  than  c* , the  alternate 
hypothesis,  , is  accepteu;  otherwise,  the  null  hypothesis, 

Hq,  is  accepted.  The  procedure  for  finding  such  a test  if  the 
appropriate  values  of  the  probability  function  are  available 
(from  tables,  for  example)  is  given  by  Guenther  (1969)  and  is 
repeated  here  for  completeness. 

1.  Start  with  the  critical  value  c*=0 

2.  Find  the  largest  n,  say  n , such  that 
H(c*;N,n  ,D  ) kl-a.  This  inequality  will 

xj  U 

hold  for  all  nsnL 

3.  Find  the  smallest  n,  say  ng , such  that 
H(c*;N,ng,D^) sfl.  This  inequality  will  be 
satisfied  for  all  n^n„ 
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4.  If  n <n  , the  plan  (n„,c*)  satisfies  the 

O Jj 

requirements  with  minimum  sample  size  and 
ng  is  n*. 

5.  If  n >n  / increment  c*  by  one  and  go  to 

S L# 

step  3. 

After  n*  and  c*  have  been  determined,  the  values  of  the  error 
probabilities  can  be  determined  as  follows: 


c* 

a*  = 1-  Z h(x;N,n*,D  ) 
x=0 

= l-H(c*?N,n*,D0) 
c* 

B*  = Z h (x;N,n* ,D. ) (1.5) 

x=0 

= H(c*;x,N,n*,D1) 
r 

where  H(r;N,n,D)=  Z h(x;K,n,D),  the  cumulative  distribution 

x=0 

function,  is  given  by  Liebernan  and  Owen  (1961)  for  N=1 (1)50(10) 100- 
For  values  of  N not  contained  in  the  table,  Guenther  (1973) 
suggests  another  approach  for  obtaining  the  desired  test.  This 
approach  is  improved  here.  Using  an  approximation  due  to  Wise 
(1954) , 


where 


(1.6) 


(1.7) 


Letting  P=l-H (c;N,n,D) , it  becomes  necessary  to  find  a solution 
for  (1.6)  in  terms  of  n given  N,D,c  and  P.  Guenther  (1971) 
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gives  the  approximation 

n«.  5{x2c+2 ; 1-P  ( 1/71-0  • 5 } +c}  d-8) 

to  be  used  with  the  binomial  distribution,  and  solves  the  system 
of  (1.7)  and  (1.8)  iteratively,  failing  to  realize  that  one  can 
solve  for  the  approximate  n directly.  It  can  be  shown  that 


n-F (N,P,c,D) 

where  F (N,P,c,D)  = |x2 (2N+1-Df 5c) +c (D-c)f / (4D-2c+x2) 


(1.9) 


O i.u 

and  Xz  is  the  P1"  percentile  of  the  chi-squared  distribution 
with  2c+2  degrees  of  freedom.  Further,  an  approximation  of  the 
desired  test  can  be  found  by  increasing  c until  the  interval  n 


F(N,P,D1,c)<n<F(N,P,D(),c)  (1.10) 

contains  at  least  one  integer.  A first  guess  for  the  test  is 
then  obtained  by  using  the  smallest  integer  in  the  interval  for 
the  sample  size  n*  and  c for  the  critical  value  c*. 

The  computer  program  given  in  the  Appendix  carries  out  the 
above  calculations  to  find  an  initial  guess  for  c*,  and  then 
uses  the  first  procedure  cf  Guenther  to  find  the  test  size  n* 
and  the  critical  value  c*  exactly. 

The  fixed  size  sample  tests  used  for  comparisons  in  this 
dissertation  are  not  randomized.  Randomization,  if  used,  can 
serve  two  purposes.  When  developing  tests  of  a discrete  dis- 
tribution, such  as  hypergeometric  distribution  considered  here, 
the  size  of  the  error  probabilities  are  often  considerably  less 
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than  the  desired  size.  Randomization  can  be  introduced  into 
the  test  such  that  the  probability  of  a Type  I (or  Type  II) 
error  can  achieve  exactly  the  desired  size,  i.e.,  a=a* 

(or  $=&*).  This  is  valuable  if  two  alternate  tests  are  to  be 
compared.  Also,  the  use  of  this  technique  may  allow  for  a 
small  reduction  in  the  required  sample  size. 

Randomization  is  accomplished,  for  example,  by  accepting 
1IQ  with  probability  y if  c*  defectives  are  found  in  the  sample, 

H.  being  accepted  with  a probability  (1-y).  y is  chosen  such 
that  the  probability  of  a Type  I (or  Type  II)  error  is  exactly 
the  desired  size.  For  simplicity  of  presentation,  randomized 
tests  are  not  used  here.  Most  of  the  comparisons  will  not  be 
affected  by  this  omission. 

I. 4  EXTENSION  TO  THREE  DECISION  TESTS 

The  extension  of  the  above  results  to  two-sided  or  three 
* 

decision  tests  is  straightforward.  The  hypotheses  are  stated 
as  follows: 

: D=D1<N/2 

versus  HQ:  D-DQ>D1 

versus  D=D2>DQ 

In  this  case,  there  are  four  types  of  errors  which  can  be  made. 

is  the  probability  of  accepting  when  is  true  and  0^  is 
the  probability  of  accepting  or  H2  when  is  true,  is  the 

* 

The  three  decision  test  is  a generalization  of  the  standard  two- 
sided  test;  that  is,  separate  a and  S errors  can  be  specified 
for  each  alternate  hypothesis  (see  Goss  (1974b) 
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probability  of  accepting  when  HQ  is  true  and  $2  is  the 
probability  of  accepting  or  Hq  when  H2  is  true.  These 
error  probabilities  are  shown  in  Figure  1.3. 


Decision  Based  on  Test  Results 


H 


True  State  H 

of 

Nature 

H 


Figure  1.3  Error  Probabilities  for  a Three  Decision  Test 


H0  K2 


NO  ERROR 

ERROR 

ERROR 

NO  ERROR  a2  ERROR 

ERROR  , NO  ERROR 

1 

The  fixed  size  sample  test  is  found  as  in  the  two  decision 
test,  except  that  there  are  now  four  constraints  to  be  considered: 


P (accept 

Hl)  = 

^l 

if 

D-Dj. 

P (accept 

Hl)  = 

al 

if 

D=D0 

P (accept 

h2)  = 

!-B2 

if 

d=d2 

P (accept 

V 

a2 

if 

D=D0 

This  gives  two  points  on  each  of  two  of  the  OC  curves  for  the 
three  decision  test.  Graphs  showing  typical  OC  functions  for  a 
three  decision  test  are  shown  in  Figure  1.4. 


True  Number  of  Defectives  D 


Figure  1.4  Typical  OC  Functions  for  a Three  Decision  Test 

The  solid  curve,  decreasing  with  D,  gives  the  probability 
of  accepting  H1  as  a function  of  the  true  state  of  nature,  D. 

The  solid  curve  increasing  with  D gives  the  probability  of 
accepting  H2.  The  dotted  curve  gives  the  probability  of  accept- 
ing Hq  for  different  values  of  D.  The  sum  of  these  three  curves 
is  of  course  1.  Again,  these  curves  are  really  step  functions 
because  of  the  discrete  nature  of  the  parameter  D,  but  are  shown 
here  as  being  continuous. 

The  test  plan  consists  of  the  sample  size  n*  and  two  critical 
values  c*  and  c*.  The  decision  rules  for  the  test  are  then: 

Li  U 

accept  H..  if  x <c* 

JL  Xj 

accept  Hq  if  c*<x<c*  (1.13) 

accept  H2  if  x >c* 

This  test  may  also  be  randomized  by  specifying  probabilities  y^ 
and  y2  for  the  probabilities  of  accepting  Hq  or  H2  when  D is 
equal  to  c*  or  c*  respectively. 
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1.5  PREVIOUS  RESULTS  WITH  SEQUENTIAL  TESTS  OF  THE  HYPERGEOMETRIC 

DISTRIBUTION 

The  first  suggestion  of  a sequential  test  of  the  hyper- 
geometric  distribution  is  found  in  Wald  11947)  where  it  is  gi/en 
as  an  example  of  a simple  case  of  dependent  observations.  Heie 
Wald  shows  how  to  perform  the  sequential  test  by  using  a sequen- 
tial probability  ratio  test  (SPRT) . Chung  (1950) , Dumas  (1969) > 
Dumas  (1970)  and  Ghosh  (1970)  also  discuss  the  SPRT  for  this 
distribution.  Similar  tests,  with  some  modification,  are  used 
here  as  explained  in  Chapter  2. 

Chung  (1950)  derives  approximations  for  the  likelihood  ratio 
used  in  sequential  tests  of  the  hypergeometric  distribution. 

These  approximations  are  valid  when  the  proportion  of  defectives 
in  the  population  is  small.  He  also  gives  some  examples  of  their 
use  in  acceptance  sampling  plans.  He  does  not  discuss  the  result 
ing  test  properties.  Yang  (1968)  gives  rules  for  a sequential 
test  of  the  hypergeometric  distribution,  but  does  not  mention  the 
properties  of  these  tests. 

Dumas,  in  his  papers,  gives  approximation  for  the  boundaries 
of  the  sequential  test  region.  These  approximations  are  based 
on  the  use  of  Stirling's  approximation  for  the  factorials  which 
appear  in  the  equations.  He  also  discusses  the  resulting  shape 
of  the  regions  and  their  geometric  relation  to  the  regions  used 
in  sequential  tests  of  the  binomial  distribution.  Some  further 
comparisons  between  the  tests  of  these  two  distributions  are 
given  in  Chapter  4, 
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Ghosh  (1970)  notes  that  because  the  observations  from  a 
|*  hypergeometric  distribution  are  not  independent,  the  OC  function 

and  other  test  characteristics  will  be  "quite  difficult  to 
determine."  This  is  because  the  approximations  given  by  Wald 
(1947)  hold  only  when  the  observations  are  independently  and 
identically  distributed  (i.i.d.).  Ghosh  (1970)  gives  conjectural 
formulas  for  the  OC  and  the  average  sample  number  (ASN)  which 
he  states  might  hold  under  certain  conditions.  These  conjectural 
formulas  have  not  been  investigated.  It  appears  that  the  task  of 
solving  these  formulas  is  considerably  more  difficult  than  using 
Aroian's  direct  method  of  sequential  analysis,  as  is  done  here. 

t Also,  the  direct  method  uses  no  approximations  and  the  results 

c'  ■*  are  exact 

* 


l 


CHAPTER  2 


CONSTRUCTION  OF  THE  SEQUENTIAL  TEST  REGIONS 
FOR  TWO  AND  THREE  DECISION  TEST  PROCEDURES 
FOR  TI1E  HYPERGEOMETRIC  DISTRIBUTION 

2.0  INTRODUCTION 

In  this  chapter,  the  method  of  finding  a sequential  test 
region  for  the  hypergeometric  distribution  is  presented.  Tests 
are  given  to  determine  the  number  of  defectives  in  a finiite 
population  when  the  population  size  is  known.  In  the  first 
part  of  the  chapter,  one-sided,  two  decision  tests  are  treated. 

In  the  second  part,  the  method  is  extended  to  deal  with  two- 
sided,  three  decision  tests.  Numerical  examples  to  illustrate 
both  cases  are  provided.  The  third  section  discusses  sequential 
tests  of  composite  hypotheses  and  the  resulting  OC  functions. 

2.1  CONSTRUCTION  OF  THE  BOUNDARIES  FOR  A TWO  DECISION  SEQUENTIAL 
TEST 


The  sequential  procedures  developed  here  will  consider  the 
s tuation  whtrn  one  is  sampling  items  one  at  a time  without  re- 
placement from  a finite  population  of  known  size.  Each  item  is 
then  classified  as  a defect  or  as  a non-defect.  Based  on  the 
total  number  of  defects  observed,  a decision  is  made  to  accept  HQ, 
the  null  hypothesis,  or  H^,  the  alternate  hypothesis,  or  to  take 
another  observation.  The  procedure  is  easily  generalized  to  con- 
sider group  sampling  (inspecting  more  than  one  item  at  each  trial) , 
although  this  is  not  considered  here. 
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In  order  to  test  a one-sided  hypothesis,  the  simple 


hypotheses 


versus 


D=Dq<N/2 


D=Di>D0 


(2.1) 


are  specified  (if  Dg>N/2,  one  can  reverse  the  designation  of 
"defective"  and  "effective"  observations) , where  D is  the  number 
of  defectives  in  a population  of  size  N.  The  sequential  test 
procedure  for  distinguishing  between  these  two  hypotheses  is 
as  follows.  Following  Wald  (1947),  a sequential  probability 
ratio  test  (SPRT)  is  carried  out  by  calculating  the  likelihood 
ratio  at  each  trial.  Let  y^=l  if  a defect  is  observed  at  trial  i 
and  0 otherwise  and  let  f(y,D)  denote  the  probability  of  ob- 
serving a defect  when  there  are  D defects  remaining  in  the  popu- 

n 


lation.  The  likelihood  at  step  n with  a sample  outcome  x= 
is  then  equal  to 


E 

i=l 


Ln(x,D.)=£(y1/D.1)-f(y2,D.2)...f(yn,Din)  (2.2) 


and  the  likelihood  ratio  for  the  simple  hypothesis  in  (2.1)  is 


L„(JC'D1>  _ ^ ^1  ,D11^ " ^ ^y2  ,D12^  * * * £ (^n'Dln^ 

Ln(x'V  f(yrDoi)’f(y2'Do2)---f(VDon) 


(2.3) 


Here  D„  is  the  number  of  defectives  remaining  in  the  population 
at  trial  j under  hypothesis  and  is  dependent  on  what  has  hap- 
pened in  the  (j-1)  previous  trials.  Note  that  D^=D^  an<^ 
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D0=D00*  test  fchen  carried  out  using  the  following  pro- 

cedure: 


accept  Hq 

if 

L (x  ,D.  ) 
n 1 

<B, 

VX'V 

accept  H^ 

if 

Ln(x’V 

>A, 

(2.4) 

VX'D0> 

take  another  sample  if  _ . L (X/D, ) _ 

c B < n 1 <A 


VX'V 

The  values  A and  B,  which  are  needed  for  the  test,  are  quite 
difficult  to  determine  exactly.  However,  the  approximate  values 

A~ (1-3) /a  B-B/  (1-a)  (2.5) 

given  by  Wald  (1947)  serve  the  purpose  well  (this  is  demonstrated 
in  the  examples  to  follow).  Here  a is  the  desired  probability 
of  a Type  I error  and  B is  the  desired  probability  of  a Type  II 
error,  as  explained  in  Chapter  1.  It  can  be  shown,  for  example, 
that  u'+B'ia+B  where  a'  and  B'  are  the  error  probabilities 
actually  given  by  the  SPRT. 

In  the  case  of  a discrete  distribution,  the  likelihood  ratio 
of  a given  sample  is  independent  of  the  particular  order  in  which 
that  sample  is  observed.  Therefore,  the  likelihood  ratio  at  each 
trial  is  easily  computed  as  the  ratio  of  the  probabilities  of 
obtaining  the  observed  sample.  In  this  case. 


L„(X'D0> 


(2.6) 
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After  expanding  the  binomial  coefficients,  the  likelihood  ratio 
can  be  expressed  as 


L (x,  D ) _ D '(N-D.)!  (D  -x) ' (N-D.-n+x) l 
n 1 — X X U u / <2  *7  \ 

Ln(x,D0)  Dq!  (N-Dq)  I * (D1-x)  ! (N-D1-n+x)  ! 17 

where  the  factor  on  the  left  is  independent  of  both  the  sample 
outcome  and  the  trial  number. 

Some  authors  (e.g. , Ghosh  (1970)  and  Chung  (1950)  ) have 
given  approximations  to  the  likelihood  ratio.  These  approxima- 
tions are  seen  to  be  unnecessary,  as  the  expression  in  (2.7)  is 
simply  and  efficiently  evaluated  by  using  a table  look-up  of 
log  factorials  within  the  computer  program  used  to  compute  the 
test  region.  The  program  given  in  the  Appendix  uses  this  method 
The  log  likelihood  ratio  is  computed  as 


iln 


Ln(x,Dl) 

Ln(x'V 


= KO^DQ^mn  ^(D0-x)  !^+£n  ^(N-DQ-n+x) 
-Jin  ^(D^-x)  -Jin  ^ (N-D^n+x) 

^(N-D1)  -Jin  ^(N-Dq)  !^. 


(2.8) 


where  K (N , Dq  , ) = f n (D^ ! ) - ?.n  (Dq  ! ) + 2.n 

To  carry  out  the  test  £n{Ln  (XjD^)  /L^  (x^q)}  is  computed  at  each 
trial  and  compared  with  a=Jin  (A)  and  b=£n(B). 

There  are  two  interesting  peculiarities  which  arise  in  the 
treatment  of  sequential  tests  of  the  hypergeometric  distribution 
first,  because  we  define  =0  if  x>n,  the  likelihood  ratio  can 


take  on  the  following  values 
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Ln ( * ' D X ) 

V*~V 


Ln(x,D3.) 

Vx'Do> 


if  x>D 


0 


(2.9) 


= 0 


if  x<n+D^-N 


In  the  first  case,  more  than  DQ  defectives  have  been  found 
and  therefore  HQ  cannot  be  true  and  is  accepted.  In  the 
second  case,  not  enough  items  remain  in  the  population  to  ever 
accept  H^*  therefore,  is  accepted. 

The  other  interesting  characteristic  of  an  SPRT  for  the 
hypergeor.ietric  distribution  is  that  the  test  is  always  closed 
at  some  finite  trial  number.  That  is,  there  is  a natural  trun- 
cation point  where  the  test  is  terminated.  This  occurs  because 
the  finite  population  is  depleted  by  sampling.  In  cases  of 
independent  observations  (e.g. , the  binomial  or  Poisson  dis- 
tribution) , no  such  natural  truncation  exists  and  the  test  is 
shown  to  be  closed  only  as  n+«>  (Wald  ,1947). 

An  upper  bound  for  this  natural  truncation  point  is 
N— Dj+Dg+1,  although  it  can  be  considerably  less  depending  on  the 
size  of  the  desired  error  probabilities  (a  and  6 errors) . This 
follows  directly  from  (2.9).  If  n=N-D^+Dg,  there  is  only  one 
value  of  x (x=DQ=n+D^-N)  such  that  neither  of  the  inequalities 
on  the  right  side  of  (2.9)  is  satisfied.  If  n>N-D^+D0  one  of 
these  inequalities  will  hold  for  each  value  of  x and  either  HQ 
or  must  be  accepted. 

In  order  to  carry  out  the  sequential  procedure  in  practice, 
it  is  usually  easier  to  have  ayailable  upper  and  lower  (integer) 


21 


limits  on  the  number  of  defectives  necessary  for  a decision 
(one  way  or  the  other)  at  each  trial.  If  we  let  cL(n)  denote 
tiie  lower  limit  and  c^fn)  the  upper  limit,  at  trial  n the 
sequential  test  procedure  becomes: 

accept  H if  x<cT  (n) 

0 L (2.10) 

accept  if  x>c^ (n) 

where  x is  the  number  of  defectives  observed  at  trial  n.  If 
neither  of  these  inequalities  holds,  another  sample  is  taken. 

If  possible,  the  values  c. (n)  and  cM(n)  are  obtained  by 
inversion  of  the  equations 

b=g  (x,D0,D1,n,N)  = ?.n 


Ln(x,D0) 

by  solving  for  x.  The  values  cT  and  c are  then  expressed  as 

J-»  U 

°L<n>=  [q"1(b,D0,D1,n,N)]  (2.12) 

Cu(n)=  fg  ^ {o,D0,D1(n,N)l  +1 

where  K=[R]  signifies  the  largest  integer  value  K such  that 
-1 

K<R  and  g is  the  inverse  function  of  g when  solving  for  x. 
Because  of  the  factorials  in  the  function  g,  these  functions 
must  be  inverted  numerically.  This  is  a simple  procedure 
because  at  each  step  one  has  very  close  lower  bounds  for  the 


a=g  (x,DQ,D1,n,N)=Kn 


VX'V 


Ln(x-V 


new  critical  values.  That  is,  the  value  of  the  inverse  function 


x=g"1(b,D(),D1/n-l)£g"1  (b,DQ,D1,n) . 


(2.13) 


Also,  the  values  in  (2.13)  are  generally  very  close  together. 

As  a result,  each  critical  value  in  (2.12)  is  usually  obtained 
with  only  one  or  two  evaluations  of  the  log  likelihood  ratio. 

The  critical  values  cT  (n) , c..(n),  n=l,2,...n**  define  the 
critical  regions  for  the  test,  where  n**  denotes  the  natural 
truncation  point.  An  example  of  such  a region  is  shown  graphically 
in  Figure  2.1.  Note  that  at  trial  n,  if  cL(n)<0,  no  decision  in 
favor  of  HQ  is  possible  and  if  Cy(n)>n,  no  decision  for  is 
possible. 


Accept  H1 


Continue 

Sampling 


Accept  Hq 


Trial  Number 


Figure  2.1  Typical  Two  Decision  Test  Region 

for  the  Hypergeometric 
Distribution 
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A computer  program  given  in  the  Appendix  carries  out  the  above 
calculations  to  compute  the  desired  sequential  test  regions  for 
any  desired  values  of  N,  , Dg,  a and  8. 

Often  n**,  the  natural  truncation  point,  is  much  larger  than 
n*,  the  sample  si2e  of  the  corresponding  fixed  size  sample  test. 
This  undesirable  situation  indicates  that  with  positive  proba- 
bility, the  test  will  require  samples  which  exceed  the  necessary 
sample  size  of  the  corresponding  fixed  size  sample  test.  Because 
this  probability  is  rather  small  and  because  this  probability 
(of  such  a large  sample)  is  largest  when  the  parameter  being 
tested  is  actually  in  the  "indifference  range"  (Dg<D<D^) , the 
test  can  usually  be  improved  by  truncation  at  some  point  nQ  n*. 
More  complete  treatment  of  this  subject,  along  with  suggested 
truncation  procedures,  is  given  in  Chapter  3. 

A numerical  example  of  the  above  procedure  for  determining 
the  sequential  test  region  is  now  given.  The  hypotheses  to  be 
tested  are 


H 

versus  H 


0: 

1: 


D=D 


D=D1=40 


(2.14) 


with  a population  size  N=100.  The  desired  error  probabilities 
are  a=0.05  and  8=0.10.  The  following  quantities  are  computed. 


b=?.n  (B)  = £n  10. 1/  (1-0. 05)  J =-2.25129 
a=£n  (A)  = £n  ( (1-0 . 1) /0 . 05  ^ = 2 • 89037 


l)  =-2.: 


(2.15) 


)=&n(40!)-ln(25i)  + ln  ^(LC0-4  0)  !^-v'.n  ^(100-25) 


=-10.9452 


As  an  example,  we  compute  the  critical  limits  at  trial  n=32 
The  log  likelihood  ratio  at  trial  32  with  x defectives  observed 


in  the  sample  is 


£n 


L32(X/D1} 


L32 ^X/Dq^ 


=K(  100,25 , 40)+£n  ^(25-x)  !^+£n  ^(75-32+x) 
~£n ^(40-x)  !^-£r. ^(60-32+x) 


(2.16) 


and  is  tabulated  for  different  values  of  x in  Table  2.1. 


Table  2.1 

Log  Likelihood  Ratio 
for  Different  Values  of  x 
at  Tria)  32 


x 


L32  (x'Di* 
L32(x'V 


-3.26 

-2.31 

-1.34 

-0.34 


0.68 

1.72 

2.80 

3.91 

5.08 


It  is  easily  seen  from  this  table  that  for  values  of  x>14, 
the  log  likelihood  ratio  is  greater  than  a=£n(A)  and  therefore 
such  points  belong  in  the  accept  region.  For  values  of  x<8, 
the  log  likelihood  ratio  is  less  than  b=£n(B)  and  therefore  these 
points  belong  in  tte  accept  region. 

When  the  above  procedure  is  carried  out  for  each  trial, 
one  obtains  the  critical  values  needed  for  the  sequential  test. 
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This  was  done  for  the  example  and  the  results  are  given  in  Table 
2.2.  A graphical  presentation  of  the  region  is  given  in  Figure 
2.2.  One  should  notice  that  at  trial  67,  cT  (n) +l=cri  (n)  . This 
implies  that  a decision  must  be  made  for  either  HQ  or  H^  and 
that  the  test  will  not  continue  past  this  point.  This  is  the 
natural  truncation  point  mentioned  earlier.  Note  that  this  value 
is  considerably  smaller  than  n**,  the  upper  limit  (N-D1+DQ+1=86) 
on  the  natural  truncation  point. 

2.2  CONSTRUCTION  OF  THE  BOUNDARIES  FOR  A THREE  DECISION 

SEQUENTIAL  TEST 

In  this  section,  the  procedure  for  developing  three  decision 
sequential  tests  for  the  number  of  defectives  in  a finite  popu- 
lation of  size  N is  given.  Three  decision  tests  are  often  nec- 
essary in  practice.  This  is  also  true  for  acceptance  sampling. 

One  example  of  their  use  would  be  when  one  must  distinguish  among 
lots  of  items  which  are  of  superior  quality  (for  which  some  in- 
centive bonus  might  be  given) , standard  quality  and  substandard 
quality.  The  tests  given  below  are  suitable  for  such  applications. 

A discussion  of  the  previous  work  concerning  sequential  three 
decision  tests  is  deferred  until  the  end  of  this  section,  after 
which  the  reader  will  be  more  familiar  with  the  subject. 

The  method  used  for  determining  the  test  procedure  is  a 
direct  extension  of  the  method  given  for  the  two  decision  test 
presented  in  the  first  part  of  this  chapter.  The  numerical 
example  given  previously  will  be  extended  to  the  three  decision 


case. 


Table  2.2 

Critical  Values  for  the 
Two  Decision  Example 


Trial  n 

cL(n) 

°D(n) 

Trial  n 

cL(n) 

cu( 

1 

* 

* 

42 

11 

16 

2 

* 

* 

43 

11 

17 

3 

* 

* 

■i/I 

12 

17 

4 

* 

* 

45 

12 

17 

5 

* 

* 

46 

13 

17 

6 

* 

6 

47 

13 

18 

7 

* 

7 

48 

13 

18 

8 

* 

7 

49 

14 

18 

9 

* 

7 

50 

14 

19 

10 

0 

7 

51 

14 

19 

11 

0 

8 

52 

15 

19 

12 

0 

8 

53 

15 

19 

13 

1 

8 

54 

15 

20 

14 

1 

9 

55 

16 

20 

15 

1 

9 

56 

16 

20 

16 

2 

9 

57 

16 

21 

17 

2 

9 

58 

17 

21 

18 

3 

10 

59 

17 

21 

19 

3 

10 

60 

17 

21 

20 

3 

10 

61 

18 

22 

21 

4 

10 

62 

18 

22 

22 

4 

11 

63 

19 

22 

23 

4 

11 

64 

19 

22 

24 

5 

11 

65 

19 

23 

25 

5 

12 

66 

20 

23 

26 

5 

12 

67 

20 

23 

27 

6 

12 

68 

20 

23 

28 

6 

12 

69 

21 

24 

29 

6 

13 

70 

21 

24 

30 

7 

13 

71 

21 

24 

31 

7 

13 

72 

22 

24 

32 

8 

14 

73 

22 

25 

33 

8 

14 

74 

22 

25 

O/t 

-J  *T 

8 

14 

75 

23 

25 

35 

9 

14 

76 

23 

25 

36 

9 

15 

77 

23 

26 

37 

9 

15 

78 

24 

26 

38 

10 

15 

79 

24 

26 

39 

10 

16 

80 

24 

26 

40 

10 

16 

81 

25 

26 

41 

11 

16 

27 
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For  a three  decision  test  the  hypotheses  are  specified  as 
H1:  D=D1<N/2 

versus  H^:  D=Dg>D^  (2.17) 

versus  H2:  D=D2>DQ 

where  D is  the  number  of  defectives  in  a population  of  size  N. 

At  each  trial,  an  item  is  selected  at  random  without  replacement 
from  the  population  and  classified  as  either  a defect  or  a non- 
defect and  one  of  four  actions  is  taken.  Either  one  of  the  three 
hypotheses  is  accepted  or  another  sample  is  taken.  It  is  the 
purpose  of  this  section  to  develop  the  rules  for  carrying  out  such 
a test. 

To  devise  a three  decision  test,  a modified  version  of  the 
Sobel-Wald  procedure  (Sobel  and  Wald,  1949)  is  used.  Following 
their  treatment,  two  SPRTs  are  used  simultaneously.  One  SPRT,  say 
SPRT1,  is  used  to  distinguish  between  and  H^.  The  other  SPRT, 
say  SPF.T2,  is  used  to  distinguish  between  HQ  and  H2.  As  explained 
in  Chapter  1,  there  are  now  four  types  of  errors  to  be  concerned 
with.  As  before,  these  errors  are  denoted  a^,  6^  (for  SPRT1)  and 
a,,  B2  (for  SPRT2) . These  two  SPRTs  are  used  to  derive  the  three 
decision  test  as  follows.  At  each  step,  calculate  the  two  likeli- 
hood ratios  and  follow  the  rules: 
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accept  if 

L (x,D  ) 
n 0 < n 

L (x,D. ) 51  “l 
n l 

accept  H2  if 


and 


VX'D2>  « 

iTTSTV- 


B, 


L (x,D  ) 

_2 y_  > a 

L (X,D.)  " A1 
n 1 


and 


Ln(x'V  . . 

VX'D0>  ' 2 


accept  Hq  if 


L (x,Dn) 

2 P , - > a 

L (x,  D ) ' A1 
n 1 


and 


Ln;x-D2> 

Ln(x'D0> 


< 


B, 


(2.18) 


otherwise,  another  sample  is  taken.  Here  again  the  values 

A1  . Aj-d-Bjl/Oj 

(2.19) 

Bl3“l/(1-6l)  B2-B2/(l-a2) 

are  used  to  approximate  the  true  values  A^,  B^ , A2,  and  B2 
necessary  for  the  test.  These  approximations,  as  will  be  seen 
from  the  numerical  results  in  the  next  chapter,  are  satisfactory 
and  provide  good  tests. 

When  carrying  out  the  sequential  test  in  practice,  it  is 
usually  easier  to  have  available  . itical  limits  on  the  number 
of  defectives  necessary  to  accept  one  of  the  hypotheses  at  each 
trial.  For  a two-sided  test,  we  must  specify  four  critical  limits, 
two  for  each  SPRT  at  each  trial.  Let  c (n)  and  c (n)  denote  the 

Li  U 

lower  and  upper  limits  respectively  for  SPRT1  at  trial  n.  Also, 

let  dT  (n)  and  dM(n)  denote  the  same  critical  limits  for  SPRT2. 

Jj  u 
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The  test  procedure  then  becomes 


accept 

H1 

if 

x<cL (n) 

and 

x<dL(n) 

accept 

Ho 

if 

Cy (n)  >x 

->dL 

(n) 

accept 

H2 

if 

x>cu(n) 

and 

x>d  (n) 
U 

(2.20) 


and  otherwise  take  another  sample. 

Following  the  same  procedure  given  in  the  first  part  of 
this  chapter,  these  critical  values  are  found  by  inverting  the 
likelihood  ratio  equations  and  may  be  expressed  as 


cL(n)  = [g"1(b1,D1/D(),n,N)J 

cu(n)  = [g_1(a1,D1,D0,n/N)]  +1 

dL(n)  = [g“1(b2,D0,D2,n,N)] 

du(n)  = [g_1(a2,D0,D2,n,H)]  +1 


(2.21) 


where  a^=£n(A1),  b^=£n(B^),  etc.  and  the  other  notation  is  the 
same  as  that  used  in  (2.12).  The  critical  limits  for  a typical 
three  decision  test  are  shown  graphically  in  Figure  2.3. 


Number 

of 

Defects 


Figure  2.3  Typica'  Three  Decision  Test  Region 

for  the  Hypergeometric 
Distribution 


A computer  program  which  is  given  in  the  Appendix  calculates 
these  critical  points  separately  for  each  SPRT.  The  region  thus 
found  can  be  evaluated  using  the  method  and  computer  program 
described  in  Chapter  3.  The  special  characterisitics  of  the  two 
decision  test,  such  as  the  natural  truncation  point,  are  also 
present  in  the  three  decision  test.  Truncation  of  these  tests 
is  considered  in  the  next  chapter. 

We  now  extend  the  example  of  the  test  given  in  (2.14)  and 
consider  a three  decision  test.  There,  the  hypotheses  considered 
were 


V D=D(f25 

versus  H, : D=D  =40 


(2.22) 
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Now  the  hypotheses  being  tested  are: 


Hl:  D=Di=1° 


versus  HQ:  D=DQ=25 


versus  H D=D2=40 


again  with  a population  of  size  N=100. 
bilities  are 


(2.23) 


The  desired  error  probe - 


a1=0.05/  61=0.10/  a2=0.05  and  B2=0.10 


We  first  obtain  the  values 


b1=en(B1)=£n(0.05/(l-0.1)  ) = -2.89037 

a1=An(A1)=£n  ( (1-0. 05) /0. 1 ) = 2.25129  (2.24) 

b2=Hn(B2)=£n  (0 . 1/ (1-0 . 05)  ) = -2.25129 
a2=in(A2)=J»n  ((1-0.1  )/0. 05)  = 2.39037 

We  now  calculate  the  limits  in  (2.21)  for  trial  n=52.  The  values 
of  the  log  likelihood  ratio  for  the  two  SPRTs  are  given  in  Tabic'  2.3. 
It  can  be  seen  from  this  table  that  one  should  accept  if  x<_7, 
accept  Hq  if  10<x<15  and  accept  H2  if  x>19,  otherwise  another  il-em 
is  inspected.  If  this  procedure  is  carried  out  for  each  trial  up 
to  n**,  the  critical  values  for  the  three  decision  test  given  in 
Table  2.4  will  be  obtained.  A graphical  presentation  of  the'  tost 
regions  for  this  test  is  given  in  Figure  2.4. 
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Table  2.3 

Log  Likelihood  Ratio 
for  Different  Values  of  x 
at  Trial  52 


X 

L52 (x,Do) 

L52  *X,°2^ 

L52(X,D1) 

L52(X'V 

6 

-5.46 

-14.12 

7 

-3.50 

-12.85 

8 

-1.13 

-11.58 

9 

1.21 

-10.31 

10 

4.36 

- 9.05 

11 

CO 

- 7.77 

12 

CO 

- 6.48 

13 

CO 

- 5.18 

14 

CO 

- 3.85 

15 

CO 

- 2.48 

16 

CO 

- 1.08 

17 

CO 

0.37 

18 

CO 

1.88 

19 

00 

3.47 

20 

CO  | 

5.15 

The  following  is  a brief  sketch  of  the  different  approaches 
to  throe  decision  tests  which  have  been  treated  in  the  litera- 
ture. The  discussion  here  is  general  in  that  it  pertains  to  no 
specific  distribution.  Ghosh  (1970)  and  Goss  (1974b)  give  excel- 
lent and  somewhat  more  comprehensive  treatment  of  this  subject. 

No  attempt  has  been  made  to  cover  the  many  applications  of  these 
tests.  For  this,  the  reader  is  referred  to  Wetherill  (1966). 

Wald  (1947),  in  his  book,  gives  a method  of  formulating  a 
two-sided  test  by  using  weight  functions.  Barnard  (1947),  in  his 
review  of  Wald's  book,  mentions  an  alternate  method  which  simply 
tests  the  null  hypotheses  separately,  against  the  two  alternatives, 
This  is  dene  by  using  two  SPRTs  at  one  time.  The  resulting  test 
regions  are  shown  geometrically  in  Figure  2.5. 
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Sobel  and  Wald  (1949)  , in  their  paper,  treat  the  three 
decision  test  in  detail.  They  use  a test  similar  to  that  sug- 
gested by  Barnard.  The  difference  is  that  each  SPRT  is  treated 
independently  of  the  other.  This  would  mean,  for  example,  that 
when  line  AB  is  crossed  by  the  path  shown  in  Figure  2.6,  wc  no 
longer  allow  acceptance  of  and  concern  ourselves  only  with 
the  results  of  SPRT2.  Thus,  is  accepted  when  line  AC  is 
crossed  at  point  p,  before  a shaded  region  is  even  reached. 

Sobel  and  Wald  hasten  to  point  out  that  such  a test,  which  de- 
pends not  only  on  the  total  sample  results,  but  also  on  the 
sample  path  (order  of  the  observations) , cannot  be  an  optimal 
one.  However,  the  test  was  used  in  their  case  because  the  inde- 
pendence of  the  two  tests  enabled  the  authors  to  derive  approx- 
imations for  some  of  the  properties  of  this  three  decision  test. 

The  Sobel-Wald  tests  and  their  approximate  properties  are  treated 
in  detail  by  Ghosh  (1970) . Here,  we  use  the  direct  method  of  se- 
quential analysis  which  can  be  used  to  find  the  exact  properties 
of  any  specified  sequential  test  region. 

Goss  (1974b),  when  treatinq  three  decision  sequential  tests 
of  the  mean  of  a normal  distribution,  compared  the  Sobel-Wald 
test  with  the  Barnard  test.  He  used  the  direct  method  to  obtain 
exact  test  results  for  such  tests.  From  his  results,  (as  one  would 
expect  intuitively)  it  is  seen  that  the  test  with  independently 
run  SPRTs  has  a smaller  expected  sample  size,  but  slightly  larger 
error  probabilities.  The  differences,  however,  are  quite  small. 

For  this  reason  and  because  it  has  somewhat  more  intuitive  appeal. 
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the  approach  suggested  by  Sobel  and  Wald  is  used  here,  with  the 
modification  that  a decision  to  accept  a hypothesis  is  allowed 
if  and  only  if  one  of  the  conditions  in  (2.20)  is  satisfied; 
that  is,  if  and  only  if  one  enters  a shaded  region  in  Figure  2.6. 

Another  approach  to  the  three  decision  test  is  given  by 
Arimitage  (1950) . In  this  paper,  Armitage  suggests  using  three 
SPRTs  simultaneously.  The  three  SPRTs  are  constructed  to  dis- 
tinguish between  and  Hq,  and  and  between  and  II 2> 

This  is  shown  graphically  in  Figure  2.7. 

In  tests  where  a^+a2<3^+82/  (which  is  the  case  in  most 
practical  applications) , the  test  will  be  almost  identical  to 
the  Sobel  and  Wald  type  regions  used  above.  If,  however, 
a1+a2>61+B2'  regions  such  as  the  ones  shown  in  Figure  2.8  are 
obtained.  In  such  cases,  the  method  of  Armitage  might  be  worth 
using.  The  test  regions  would  be  similar  to  those  shown  in 
Figure  2.7.  The  computer  program  given  in  the  Appendix  is 
general  and  may  be  used  to  evaluate  such  regions  if  desired. 

2.3  TEST  OF  COMPOSITE  HYPOTHESES  AND  THE  OC  FUNCTION 

This  section  will  consider  sequential  tests  of  composite 
hypotheses.  It  will  be  shown  here  that  the  Wald  SPRT,  used  in 
Sections  2.1  and  2.2  and  based  on  pairs  of  simple  hypotheses, 
can  be  used  to  obtain  satisfactory  sequential  tests  for  composite 
hypotheses.  The  discussion  below  pertains  to  two  decision  tests, 
although  the  ideas  also  apply  to  k>2  decision  tests. 

When  finding  a fixed  size  sample  test  to  choose  between  one 
of  two  specified  hypotheses,  one  must  specify  both  the  sample  size 
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n*  and  critical  value  c*  to  give  the  desired  error  probabilities, 
as  explained  in  Section  1.3  (randomization,  of  course,  can  also 
be  used  in  the  test) . When  this  special  case  is  generalized  to 
a sequential  procedure  where  stopping  rules  are  selected  for 
each  trial,  the  problem  of  selection  of  the  proper  test  becomes 
much  more  complicated  because  there  are  many  more  possible  tests 
to  choose  from.  To  find  a sequential  test,  one  must  choose  an 
upper  and  a lower  limit  for  the  number  of  defectives  at  each  trial 
number  and  possibly  nQ,  a truncation  point  for  the  sequential  test. 

It  is  well  known  that  the  Wald  SPRT  gives  optimum  regions 
for  testing  a simple  hypothesis  against  a simple  alternative  under 
certain  conditions  (Wald  and  Wolfowitz,  1948).  Such  hypotheses 
are  stated,  for  example,  as 


V D=D0 


versus  : D=D^ 


(2.25) 


as  shown  in  Figure  2.9.  The  hypotheses  are  represented  as  points 
if  they  are  simple,  as  in  this  case,  and  as  line  segments  if  they 

Simple  Hypotheses 


;0  1 
Composite  Hypotheses 


D. 


Figure  2.9  Simple  and  Composite  Hypotheses 
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are  composite.  For  our  purpose,  we  define  simple  and  composite 
hypotheses  to  be  hypotheses  with  exactly  one  point,  and  more 
than  one  point  respectively.  Statistical  tests  between  two 
alternative  simple  hypotheses  imply  that  the  experimenter  be- 
lieves that  there  are  only  two  possible  values  for  the  true 
state  of  nature.  Such  situations  do  not  often  occur  in  practice. 

In  most  cases  the  hypotheses  to  be  tested  are  composite  and 
expressed  in  a form  similar  to 


or 


V 

a 

it 

D 

o 

versus 

V 

d^o 

(2.26) 

V 

D*Do 

versus 

Hr 

DiDl>D0 

(2.27) 

When  using  a statistical  test,  the  important  distinction  between 
the  simple  hypotheses  in  (2.25)  and  the  composite  hypotheses  of 
(2.26)  and  (2.27)  is  that  in  the  latter  one  is  interested  in  all 
of  the  points  of  the  OC  function  over  a specified  range  of  the 
parameter  values  given  by  the  hypothesized  states  of  nature. 

The  hypotheses  shown  in  (2.26)  do  not  contain  any  specific 
alternative  and  are  the  types  generally  specified  in  so-called 
"tests  of  signif icar.ce.  " Users  of  such  tests  generally  use  a 
specified  significance  level  (a  error)  and  sample  size,  but  do 
not  mention  a specific  alternative  hypothesis  and  therefore 
often  do  not  consider  the  "power"  of  their  tests.  The  rationale 
for  such  a test  is  that  there  is  a strong  prior  belief  in  (or 
preference  for)  the  null  hypothesis  and  that  it  is  not  to  be 
rejected  unless  there  is  strong  evidence  (i.e., 
fidence  level)  that  it  is  not  true. 


at  the  1-u  con- 


42 


By  examining  the  Type  II  error  (which  is  one  minus  the 
power  of  the  test  at  a specified  alternative)  one  can  determine 
if  the  significance  level  of  the  test  has  been  set  too  low  (or 
too  high)  for  a given  sample  size  or  if  the  sample  size  is  much 
too  large  (or  too  small)  for  the  required  sensitivity  against 
alternatives  to  the  null  hypothesis.  Either  of  these  consequences 
could  be  costly.  It  does  no  harm  for  even  the  "significance 
tester"  to  investigate  to  which  alternatives  his  test  will  be 
sensitive.  From  this  it  is  seen  that  it  is  important  to  examine 
the  power  of  a statistical  test. 

In  this  light,  the  pair  of  hypotheses  in  (2.27)  is  considered. 
Here  a range  of  values  has  been  specified  for  , the  alternative 
hypothesis,  as  well  as  for  HQ,  the  null  hypothesis  (see  Figure  2.9). 
The  values  in  between  DQ  and  constitute  an  "indifference  zone." 
For  the  situation  where  one  must  make  a decision  for  either  Hq 
or  for  , and  where  there  are  positive  costs  (tangible  or  not) 
for  both  types  of  errors,  this  is  a more  practicable  way  of  speci- 
fying the  hypotheses  to  be  tested. 

This  again  brings  out  the  subtle  difference  between  a "test 
of  significance"  and  other  composite  tests  of  hypotheses.  A test 
of  significance  might  be  valid,  for  example,  for  a test  used  in 
proving  some  law  of  nature,  for  which  it  is  nearly  impossible  to 
specify  all  of  the  possible  alternatives.  In  contrast,  when 
testing  the  ability  of  a new  drug  to  cure  a disease,  for  example, 
the  situation  is  different. 

If  the  proportion  of  successful  cures  of  a drug  is  to  be 
compared  with  that  of  a control  or  a placebo,  the  hypotheses  to 
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be  tested  will  usually  be  stated  as  H^:  P^=P2  vs*  : ^1<^>2' 
where  and  are  the  probabilities  of  a successful  cure  for 
the  control  and  the  drug  being  tested  respectively  (both  being 
unknown).  In  this  case,  there  are  true  costs  (although  they  are 
probably  intangible)  for  both  types  of  errors;  that  is,  for 

■k 

accepting  the  new  drug  as  "significantly  better"  when  it  is  not 
and  for  rejecting  it  when  it  is  "significantly  better."  Because 
both  of  these  errors  are  important,  it  is  imperative  that  the 
experimenter  examine  the  power  of  his  statistical  test  so  that 
the  errors  can  be  balanced  if  necessary.  These  same  ideas  are 
important  in  the  development  of  sequential  tests  of  composite 
hypotheses. 

When  developing  sequential  tests,  it  is  usually  necessary 
to  specify  some  specific  alternative (s)  to  the  *ull  hypothesis. 

Tnis  is  so  that  proper  stopping  rules  can  be  formulated  to 
control  both  types  of  errors  and  so  that  the  test  properties  of 
the  sequential  test  can  be  assessed.  If  one  wishes  to  test  a 
composite  hypothesis  s 'ch  a s (2.27),  we  must  find  a sequential 
test  procedure  which  has  a satisfactc  y OC  function  over  a speci- 
fied range  of  parameter  values.  This  is  usually  done  with  respect 
to  some  additional  criterion  concerning  the  cost  of  sampling. 

Although  the  Wald  procedure  provides  optimal  tests  under 

k k 

certain  conditions,  there  remains  the  problem  of  finding  optimum 
sequential  tests  for  the  composite  hypotheses  consideied  here. 

* 

Her r mean  a difference  of  practical  significance,  rather  than 

simp  a difference  of  statistical  significance. 

* * 

The  criterion  for  optimality  is  left  open  for  now.  More  treatment 

is  given  to  this  subject  in  Section  3.2. 
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In  Sections  2.1  and  2.2  sequential  test  regions  were  found  by 
specifying  simple  hypotheses.  Wald  (1947)  discusses,  this  problem 
at  some  length.  He  comes  to  the  conclusion  that  the  test  of  the 
simple  hypotnesis  in  (2.25)  can  be  used  to  approximate  a test  of 
a composite  hypothesis  such  as  (2.27)  without  much  loss  of  effi- 
ciency. This  is  the  method  most  commonly  used  to  find  regions 
for  a sequential  test  of  a composite  hypothesis. 

One  should  examine  the  possible  consequences  of  using  such 
an  approximation,  that  is,  carefully  examine  the  OC  function  of 
the  test.  If  the  resulting  OC  function  is  not  close  to  the 
desired  OC  function,  the  test  region  can  be  modified  so  that  il 
is.  The  numerical  examples  given  in  Chapters  4 and  5 will  show 
how  this  is  done  by,  for  example,  comparing  the  OC  function  of 
a fixed  size  test  with  that  of  a sequential  test.  Although  no 
claim  of  optimality  is  made  for  the  above  tests,  a procedure  for 
finding  such  optimal  or  near  optimal  tests  is  outlined  in 


Section  3.2. 


CHAPTER  3 


EVALUATION  OF  THE  TEST  REGIONS  USING  THE  DIRECT  METHOD 

OF  SEQUENTIAL  ANALYSIS 


3.0  INTRODUCTION 

This  chapter  describes  the  evaluation  of  the  sequential 
test  regions  for  the  hypergeometric  distribution.  In  the  first 
section,  the  direct  method  of  sequential  analysis  is  introduced. 

It  is  this  method  which  is  used  to  find  the  exact  properties  of 
the  sequential  test  regions  given  in  Chapter  2.  Section  3.2 
explains  how  truncation  of  the  regions  can  be  used  to  improve 
the  properties  of  a sequential  test  and  suggests  procedures  for 
doing  this.  The  following  two  sections  explain  in  detail  how 
the  direct  method  is  used  to  obtain  the  test  properties  for  the 
two  and  three  decision  sequential  test  regions  developed  in 
Chapter  2.  Numerical  examples  for  each  of  these  cases  are  also 
given. 

3.1  THE  DIRECT  METHOD  OF  SEQUENTIAL  ANALYSIS 

The  direct  method  of  sequential  analysis,  given  by  Aroian 
(1968),  describes  a general  method  whereby  the  exact  properties 
of  a given  sequential  test  region  may  be  obtained.  Since 
Aroian's  1968  article,  the  method  has  been  used  in  a variety  of 
applications,  including  tests  for  the  mean  of  a normal  distribu- 
tion with  the  standard  deviation  known  (Aroian  and  Robison,  1969), 
and  unknown  (Schmee,  1974);  two-sided  tests  of  the  normal  dis- 
tribution with  the  standard  deviation  known  (Goss,  1974b), 
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sequential  rank  tests  (Elfring  and  Schultz,  1973);  tests  of 
the  binomial  distribution  (Corneliussen  and  Ladd,  )970  and  1971) 
and  tests  of  the  variance  of  a normal  distribution  with  mean  known 
or  unknown  (Aroian,  Gorge,  Goss  and  Robison,  1975). 

Before  using  a sequential  test  procedure,  one  should  know 
or  have  available  reasonable  approximations  to  the  actual  test 
properties.  The  most  important  test  properties  are  the  true  a 
and  6 error  probabilities  (denoted  a'  and  p ' here)  and  the  ex- 
pected or  average  sample  number  (ASN) , which  is  a function  of 
the  true  state  of  nature.  A typical  ASN  function  is  shown  in 
Figure  3.1.  Also  of  interest  is  the  operating  character  i :;t.  i e (Of) 
function  which  gives  the  probability  of  accepting  as  a function 
of  che  true  state  of  nature  (in  this  case,  D,  the  actual  number 
of  defectives  in  the  population).  A typical  OC  function  is  shown 
in  Figure  3.2.  The  true  a and  p error  probabilities  for  a two 
decision  test  are  obtained  directly  from  the  OC  function  as 


ct ' = 1-0C(Dq) 

8'  = OC(D1) 


(3.1) 


Approximations  to  the  OC  and  ASN  functions  are  given  by  Wald 
(1947).  These  approximations  are  valid  only  if  the  observations 
are  independent.  This  is  not  the  case  with  the  hypergeomei r i c 
distribution.  Also,  even  if  the  restriction  does  hold,  the 
adequacy  of  these  approximations  varies  from  test  to  test.  The 
direct  method  of  sequential  analysis  as  explained  below,  will 
allow  one  to  find  both  the  OC  and  ASN  functions  exactly. 


Figure  3.3  Typical  Distribution  of  the  DSN 


48 


Another  interesting  test  characteristic,  which  is  often 
neglected,  is  the  distribution  of  the  decisive  sample  number 
(DSN),  that  is,  the  probability  mass  function  of  the  sample  size 
required  to  7ome  to  a decision.  This  distribution  is  also  a 
function  of  the  true  state  of  nature.  From  this  distribution, 
one  can  obtain  the  ASN,  the  variance  of  the  sample  number  (VSN) 
or  other  moments.  The  direct  method  is  also  used  to  find  the 
distribution  of  the  DSN.  A typical  probability  mass  function  for 
the  DSN  is  shown  in  Figure  3.3. 

In  general,  the  direct  method  is  carried  out  as  follows. 

Once  the  sequential  test  region  has  been  specified,  one  then 
chooses  a state  of  nature  which  allows  the  computation  of  the 
probability  of  accepting  each  possible  hypothesis  at  the  first 
trial.  The  remaining  probability,  that  is,  the  probability  of 
being  in  the  continuation  region,  is  spread  out  among  all  of 
the  possible  values  of  the  sample  statistic  which  are  included 
in  the  continuation  region.  At  the  second  trial,  another  sample 
is  taken.  It  is  again  necessary  to  find  the  probability  of 
accepting  each  hypothesis  and  the  distribution  of  probability  of 
remaining  in  the  continuation  region.  Using  convolutions,  one 
can  continue  this  process  at  each  succeeding  trial  until  the  t<*s( 
is  truncated  or  until  the  probability  of  continuation  is  so  sin, ill 
as  to  be  insignificant.  This  procedure  is  then  repeated  using 
different  values  for  the  state  of  nature,  each  giving  a point  on 
the  OC  function  and  a distribution  of  the  DSN.  This  is  done  for 
the  hypergeometric  distribution  in  Sections  3.3  and  3.4. 
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3.2  TRUNCATION  OF  THE  SEQUENTIAI  TEST  REGION 

One  disadvantage  of  using  sequential  test  procedures  is  that 
because  the  sample  size  is  a random  variable,  it  is  sometimes 
possible  (usually  with  small  probability)  for  the  sample  size  to 
be  significantly  larger  than  the  sample  size  necessary  for  a 
fixed  size  sample  test  (n*).  This  section  presents  met-hods  for 
truncating  sequential  tests  at  some  trial,  say  nQ.  This  will 
result  in  a closed  sequential  test  whose  test  properties,  with 
respect  to  the  ASN  function,  will  be  much  improved.  The  price 
paid  for  this  improvement,  as  shown  in  Chapter  4,  is  usually 
quite  small. 

Wald  and  Wolfowitz  (1948)  show  that  a SPRT  for  a simple 
hypothesis  with  a simple  alternative  and  with  i.i.d.  observa- 
tions, has  the  smallest  ASN  (at  the  parameter  values  specified 
by  the  simple  hypotheses)  of  all  other  tests  with  the  same  a and 
8 error  probabilities.  This  of  course  also  implies  that  the  ASN 
of  the  SPRT  will  be  smaller  than  the  sample  size  of  the  corre- 
sponding fixed  size  sample  test.  It  must  be  remembered,  however, 
that  this  is  guaranteed  to  be  true  only  for  the  parameter  values 
specified  by  the  simple  hypotheses.  For  parameter  values  which 
fall  between  these  two  values,  the  value  of  the  ASN  of  the  untrun- 
cated test  may  even  rise  above  n*.  This  means  that  for  some  val- 
ues of  the  true  state  of  nature,  the  ASN  of  the  sequential  test 
will  be  greater  than  the  sample  size  required  for  the  fixed  size 
test.  It  should  be  noted  that  this  occurs  for  those  values  of 
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the  parameter  which  are  in  the  so-called  "indifference  range." 
This  is  shown  in  a graph  of  a typical  ASN  function  in  Figure  3.1. 
Truncation  of  the  sequential  test  at  some  trial  number,  say  nQ, 
is  often  used  to  help  alleviate  these  problems.  It  has  been 
shown  and  will  be  further  demonstrated  here  that  truncation  of 
sequential  tests  will  both  eliminate  the  possibility  of  an  ex- 
tremely large  sample  and  significantly  reduce  the  ASN  over  the 
space  of  the  parameter  value. 

It  is  to  be  expected  that  some  price  must  be  paid  for  this 
improvement  in  the  test,  which  is  indeed  the  case  here.  After 
truncation  of  a Wald-type  SPRT,  the  true  a and  0 error  probabi  1- 
ities  will  usually  increase  somewhat.  (In  fact,  the  entire  OC 
function  will  change.)  This  increase  is  usually  quite  small 
because  the  probability  of  such  large  samples  is  highest  near 
the  middle  of  the  "indifference  range"  and  relatively  small  near 
the  values  of  the  parameters  specified  in  the  simple  hypotheses. 
Because  the  true  a and  8 error  probabilities  of  the  untruncated 
SPRT  are  often  smaller  than  the  specified  error  probabilities 
(i.e.,  a'<a  and  8'<0)/  a small  increase  in  these  probabilities 
can  usually  be  tolerated.  Further  modification  of  the  region 
near  the  truncation  trial  number  (n^)  can  be  used  to  adjust  th<'se 
probabilities  to  be  quite  close  to  their  desired  values. 

Often  when  truncation  procedures  are  put  forward,  the  trun- 
cation point  suggested  is  from  1.5  to  3 times  n*  (e.g.,  Wald, 


(1947)).  This  is  probably  because  in  the  past,  very  lift  It'  was 
known  about  the  exact  properties  of  such  untruncated  tests.  When 
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using  the  direct  method,  however,  this  presents  no  problem 
because  the  direct  method  is  general  and  can  be  used  to  evaluate 
any  specified  test  region.  From  the  numerical  examples  given 
in  Chapter  4,  it  will  be  seen  that  the  truncation  point  can  bo 
moved  much  closer  to  n*,  while  still  keeping  a'~u  and  fi '"(<• 
Procedures  for  truncating  sequential  tests  of  the  hypergeometric 
distribution  and  for  comparing  alternate  tests  are  discussed  next. 

Truncated  sequential  tests  presented  here  will  be  truncated 
at  n*  (i.e.  nQ=n*) . If  the  desired  error  probabilities  cannot  be 
achieved  when  truncating  at  n*,  and  if  it  is  necessary  to  do  so, 
nQ  can  be  moved  one  way  or  the  other  to  help  achieve  the  desired 
error  probabilities  (e.g.,  nQ  should  be  increased  to  decrease  tilt' 
true  error  probabilities). 

Once  the  trial  number  where  the  test  is  to  be  truncated  has 
been  specified  (i.e.,  n^),  it  becomes  necessary  to  determine'  wh.it 
shape  the  test  region  should  have  around  nQ.  This  has  been  a much 
debated  topic  and  is  treated  at  some  length  by  Goss  for  the  three 
decision  normal  distribution  sequential  test  (Goss,  1974b)  and  for 
the  binomial  distribution  sequential  test  (Goss,  1974a).  There 
he  compares  and  contrasts  the  "right  angle"  truncation  and  the 
"wedge"  type  of  truncation  illustrated  in  Figure  3.4.  He  finds 
that  while  the  differences  are  small,  the  wedge  truncation  has  a 
slightly  lower  ASN  and  slightly  larger  error  probabilities. 
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Figure  3.4  Wedge  and  Right  Angle  Truncation 


I 
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When  comparing  different  types  of  truncation,  or  to  be  more 
general,  different  types  of  test  regions,  one  usually  specifies 
some  sort  of  optimality  criteria  with  which  alternative  tests 
(i.e.,  different  test  regions)  can  be  compared.  This  can  also 
be  used  to  optimize  a test  procedure.  Two  examples  of  such  cri- 
teria are: 

1)  Minimizing  the  maximum  of  the  ASN  function  over 
the  parameter  space. 

2)  Minimizing  the  expected  sample  number  using  a 
prior  distribution  for  the  parameter (s)  being 
estimated. 


Monahan  (1973)  uses  Bayesian  decision  theory  with  a specified 
loss  function  to  construct  "admissible"  truncated  sequential 
test  regions  for  testing  the  mean  of  a normal  distribution. 
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When  comparing  alternate  tests,  however,  it  is  usually 
necessary  to  take  into  account  the  differences  in  the  error 
probabilities  (or  the  OC  function) . This  can  be  done  in  one  of 
two  ways.  First,  the  randomization  scheme  described  in  Section 
1.3  can  be  applied  to  the  sequential  test  procedure  and/or 
modification  of  the  regions  can  be  used  to  adjust  the  error  prob- 
abilities. Also,  an  objective  function  can  be  constructed  which 
takes  the  size  of  the  error  probabilities  (or  to  be  more  general, 
the  OC  function)  into  consideration.  The  comparisons  made  in 
Chapter  5 will  be  made  between  tests  which  have  approximately  tin- 
desired  a and  ft  probabilities.  In  most  cases  this  will  not  ad- 
versely affect  the  validity  of  the  comparisons  which  are  made. 

When  truncating  a discrete  distribution  such  as  the  binomial 
or  the  hypergeometric  considered  here,  there  are  only  a finite 
number  (which  may  be  rather  large)  of  sensible  regions  to  use  for 
a given  test  procedure.  In  the  truncated  reqions  considered  here, 
the  Wald  regions  developed  in  Section  2.3  are  used  with  tin  trun- 
cation rules  given  next. 

With  sequential  tests  of  the  hypcrgeometric  (or  any  other) 
distribution,  one  should  truncate  the  test  such  that  there  art-  no 
points  in  the  continuation  region  from  which  only  one  decision  can 
be  made.  Such  points  can  only  increase  the  ASN  function  and  do 
not  affect  the  error  probabilities  and  therefore  should  be  made 
part  of  the  region  for  accepting  the  appropriate  hypothesis. 

Point  p in  Figure  3.5a  is  an  example  of  such  a point.  If  the 
above  rule  is  followed  when  truncating  a typical  Wald  region. 


Trial  Number  n n( 


Figure  3.5a  Improper  Truncation 


Trial  Number  n n 


Figure  3.5b  Proper  Truncation 


the  continuation  region  will  have  a horizontal  upper  boundary 
and  45°  lower  boundary.  This  is  shown  in  Figure  3.5b.  Ono 
should  note  that  this  is  the  only  sensible  method  of  truncation 
for  dichotomouo  distributions.  When  truncating  a sequential  test 
for  a continuous  distribution,  the  solution  to  this  problem  is 
not  so  clear,  as  there  are  an  infinite  number  of  points  in  the 
sample  space.  The  problem  of  "optimal  truncation"  needs  to  be 
more  fully  explored  for  these  cases. 

When  truncating  a sequential  test  of  the  hypergeometric  dis- 
tribution, there  are  two  decisions  which  must  be  made.  First, 
one  must  decide  the  trial  number  where  the  test  is  to  be  truncated. 
As  mentioned  earlier,  this  can  usually  be  at  or  near  n* , the?  sample 
size  of  the  fixed  size  sample  test.  This  truncation  point  is 
denoted  nQ.  In  addition,  it  is  also  necessary  to  speci ~y  the 
critical  value  for  the  sequential  test  at  this  trial.  That  is, 
one  must  determine  the  proper  cM  (nn)=cT  (n. ) +1 . Using  the  chosen 
nQ  and  Cy{n0)  and  the  rules  given  above  with  the  Wald  regions 
given  in  Section  2.1,  the  truncated  test  procedure  is  completely 
specified. 

It  may,  under  certain  circumstances,  be  desirable  to  furl  her 
modify  the  Wald  regions.  This  can  be  done  if  one  follows  the 
rules : 


Cy  (n+1) 


c (n+1) 

li 


Cy (n) +1 
CL(n) 


or 


(3.2) 
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which  are  a generalization  of  the  truncation  rules  given  above 
and  should  be  true  of  all  critical  values  which  define  the  test 
region.  These  rules  simply  state  that  the  upper  and  lower  bound- 
ary points  do  not  decrease  with  n and  never  increase  by  more  than 
one  at  any  given  trial. 

In  order  to  make  the  changes  suggested  above,  one  should 
know  the  effect  of  the  different  types  of  modification.  Those 
are  outlined  in  Table  3.1. 


Table  3.1 
Effects  of  Region 
Modification* 


OC  (D) 

ASN(D) 

increase 

cu(n) 

decrease 

increase 

decrease 

cu(n) 

increase 

decrease 

increase 

cL<n) 

increase 

decrease 

decrease 

cL(n) 

decrease 

increase 

Note,  for  example,  if  an  increase  is  indicated  in 
this  table,  the  function  in  some  cases  will  remain 
the  same,  but  will  not  decrease. 


The  regions  obtained  by  using  the  above  procedure  must 
ultimately  be  judged  on  the  basis  of  their  exact  test  properties, 
which  can  be  found  by  using  the  direct  method  as  shown  in  the 
next  section.  This  usually  leads  to  an  iterative  procedure  to 
find  the  proper  test.  Such  a procedure  begins  by  evaluating  a 
suggested  test  region  to  find  its  test  properties.  If  the  test 
properties  are  not  satisfactory,  the  test  region  is  modified 
using  the  suggestions  above  and  evaluated  again  using  the  direct 
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method.  Several  such  iterations  may  be  necessary  to  achieve 
the  desired  test  properties.  This  procedure  is  illustrated 
with  the  numerical  examples  presented  in  Chapter  4. 

When  truncating  a k>2  decision  test,  it  is  necessary  to 
truncate  each  of  the  SPRTs  separately.  This  is  illustrated  in 
the  example  given  in  Section  3.4. 

3.3  OBTAINING  THE  TEST  PROPERTIES  OF  A TWO  DECISION  TEST  REGION 

This  section  will  explain  how  the  direct  method  of  sequen- 
tial analysis  is  used  to  find  the  exact  test  properties  for  a 
two  decision  test  of  the  hypergeometric  distribution.  It  will 
be  shown  how  one  can  obtain  both  the  OC  function  and  the  dis- 
tribution of  the  decisive  sample  number  (DSN).  From  these,  one 

can  also  find  the  average  sample  number  (ASN)  and  the  true*  <t 
and  8 error  probabilities,  a'  and  The  two  decision  test  region 

developed  in  Section  2.1  will  be  evaluated  here  as  a numerical 
example . 

As  explained  in  Section  3.1,  the  direct  method  is  used  by 

computing  both  the  probability  of  making  each  decision  and  the 

distribution  of  probability  remaining  in  the  continuation  region 

at  eacn  trial.  The  probabilities  at  tria!  n+i  are  computed  by 

convoluting  the  probability  remaining  in  the  continuation  region 

at  trial  n with  the  sample  taken  at  trial  n+1.  This  is  done  for 

each  trial  for  n = l,  2,....  n . In  a discrete  distribution  smell 

0 

as  the  hypergeometric,  this  entails  . umming  probabi 1 i t ios  at  each 
trial  of  the  test.  For  the  hypergeometric  distribution,  this  is 
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illustrated  with  the  grid  shown  in  Figure  3.6.  The  critical 
values  which  define  this  test  region  are  given  in  Table  3.2. 

For  illustrative  purposes,  this  region  has  been  truncated 
at  trial  10,  using  the  rules  given  in  Section  3.2.  That  is, 
the  critical  values  cr  (n)  and  c„  (n)  are  non-decreasing  with  n 
and  never  increase  by  more  than  one  at  any  given  trial.  The 
probability  of  reaching  each  point  which  is  in  the  region  or 
on  its  boundary  is  a function  of  the  true  state  of  nature  (true 
number  of  defectives  in  the  population,  D) . These  probabilities 
differ  from  those  of  the  corresponding  hypergeometric  distribu- 
tion (i.e.,  the  probability  with  a fixed  size  sample  test)  only 
because  of  the  difference  in  the  number  of  paths  available  to 
reach  a given  point.  The  probabilities  are  computed  recursively 
starting  with  the  point  at  which  the  origin,  when  no  samples  have' 
to  be  observed;  the  probability  of  this  point  is,  of  course,  1. 
The  recursive  formula  used  to  compute  the  probabilities  at-  each 
trial  is: 


P (x, J,n+1 ,N) =1 (x,n)P(x;D,n,N)» (N-n-D+x) / O-n) 

+1 (x-l,n)P (x-l,D,n,N) . (D-x^ 1 )/(N-n) 

(3.  3) 


where 

P(x,D,0,N)  = 

I (x, n)  = 

The  indicator  function  accounts  for  the 
minates  when  one  of  the  critical  points 


| 0 otherwise 
1 c.  (n)vxn  cTI(n) 

ii  u 

0 otherwise 
fact  that  the  test 
is  reached. 


t er- 


ritical  Limits 


Figure  3.6  Probability  Grid  for  a Two  Decision  Sequential  Test 
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There  are  two  possible  outcomes  at  the  next  trial  from  ouch 
point  in  the  continuation  region;  each  inspected  item  may  bo 
either  a defect  or  a non-defect.  This  is  also  illustrated  in 
Figure  3.6.  When  a critical  point  cT  (n)  or  c..(n)  is  reached 
at  trial  n,  the  test  stops  and  a decision  is  made  in  favor  of 
either  or  H^.  Thus,  the  probability  that  the  test  terminates 
at  trial  n is  the  probability  of  reaching  one  of  the  critical 
points  at  trial  n. 

Referring  to  Figure  3.6,  an  X at  a point  indicates  a crit- 
ical point  for  acceptance  of  (c^fn));  an  @ signifies  a 
critical  point  for  acceptance  of  (c^  (n) ) . One  should  note  that 
some  critical  values  can  never  be  reached  (e.g.,  0 defectives 
observed  at  trial  4 and  4 defectives  observed  at  trial  6).  This 
is  because  there  are  no  paths  leaving  the  critical  points. 

After  the  probabilities  for  the  points  shown  on  the  grid 

in  Figure  3.6  have  all  been  computed,  it  is  an  easy  matter  to 

determine  the  exact  test  properties.  Let  AO  , A1  and  C denote 

n n n 

the  events  of  accepting  HQ,  accepting  H ^ and  continuing  to  trial 
n+1  respectively.  The  probabilities  of  Al^  and  AO^  at  each  trial 
are  then  the  probabilities  of  reaching  points  c (n)  and  c.  (n) 
respectively. 

This  can  also  be  expressed  as  follows: 

P(Ain,D)=  Z.Ji  (x,n)P(x;D,n,N)  (3.4) 

where  T , , _ (l  if  (x,n)  > Ai 

|0  otherwise 

Of  course,  P(C  ,D)  = P (C  .,D)-P(A0  ,D)-P(A1  , l) ) . 

n n- 1 n n 


|UWW"»IH«H|''*I'*I  wi',11  UJ 1 114.1  will  ■■III  ipy  manway  iihiiiww^.  ■ ^ii'” 
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Another  interesting  way  to  look  at  the  direct  method  is  to 
consider  the  sequential  process  a Markov  chain  with  absorbing 
states.  Each  point  in  the  sample  space  is  a state  and  each  bound- 
ary point  is  an  absorbing  state.  The  transition  probabilities 
from  each  state  are  a function  of  the  true  state  of  nature.  In 
order  to  use  the  direct  method,  it  becomes  necessary  to  find  the 
probability  of  absorption  in  each  of  the  absorbing  states.  This 
will  give  P(AOn;D)  and  P(Aln;D)  for  n=0,l,...no-  Madsen  (1974) 
uses  a Markov  chain  technique  to  find  the  properties  of  a truncated 
SPRT . 

The  distribution  of  the  decisive  sample  number  (DSN)  (i.e., 
the  probability  of  the  test  terminating  at  trial  n)  can  be  ex- 
pressed as  follows: 


P (n;D)=P  (AO  U A1  ;D) 

n n o.<> 

=P  (AO  ;D)+P(A1  ; D) 
n n 

This  is  computed  for  each  n up  to  n^,  the  first  trial  where 
cL (n) +l=Cy (n) . This  is  the  truncation  point  of  the  test.  The 
ASN  and  VSN  are  then  computed  as 

n0 

ASN (D)  = E nP (n ; D)  (3.7) 

n=l 


VSN (D)  = l (n-ASN (D) j P (n;D) . (3.8) 

n=l 


The  ASN  can  also  be  expressed  as 
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This  alternate  form  is  given  by  Aroian  (1975)  and  shows 
how  the  ASN  function  "builds  up"  at  each  trial  of  the  sequential 
test  • 

The  computer  program  in  the  Appendix  computes  these  quan- 

t h 

tities.  If  desired,  the  k moment  about  the  origin  can  be 
computed  as 

n0 

E(nk,D)  = T.  nkP(n;D)  (3.10) 

n=l 


The  OC  function  of  the  test  is  computed  as 


n. 


0C(D)  = Y.  P (AO  ; D) 
n=l 


(3.11) 


As  mentioned  earlier,  the  exact  a and  8 errors  are 


u'  = 1-  OC (Dq ) 
8'  = OC(Dx)  . 


(3.12) 


The  OC  function  is  also  calculated  by  the  computer  urogram  given 
in  the  Appendix. 

The  above  properties  have  been  computed  for  the  test  region 
obtained  in  Section  2.1  and  truncated  as  in  Section  2 of  this 
chapter.  Because  of  space  limitations,  the  distribution  of  the 
DSN  is  shown  for  only  one  value  of  D.  These  are  shown  in  Tables 
3.3  and  3.4.  Graphs  of  the  OC  and  ASN  functions  for  the  test  .in- 
shown  in  Figures  3.7  and  3.8.  These  properties  are  typical  of 
most  sequential  tests  of  the  hypergeometric  distribution.  A com- 
plete discussion  of  these  test  properties  is  deferred  until 
Chapters  4 and  5,  when  a more  complete  examination  of  some  numer- 
ical examples  is  presented. 


Tabic  3.3 

Distribution  of  the  DSN  for  the  Two  Decision  Example 
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P ( HO  ? 

P { Ml  > 

P<T) 

PCCi 

TRIAL  p < HO ) 

P ( HI ) 

P ( T J 

PtC) 

6 

0.00000 

0.00050 

0,00050 

0.99950 

44 

0.03031 

0.00365 
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0.30477 

7 

0.00000 

0:00000 

O.'OOOOO 
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45 

0.00000 

0.0074? 

0.00749 

0.29/28 

6 

0.00000 
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46 
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0, 0113 j 

0,05191 
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9 
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0.00000 
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10 
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16 
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*0.00000 
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39 
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0.00000 
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-7 

O'.OOOOO 
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40 

0.00000 

0.00329 

0XP0324 
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O'.  00174 
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0.00123 

41 
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7? 
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42 
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O'.OOOOO 
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81 
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Table  3.4 

Properties  of  the  Two  Decision  Test  Example 
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0,084706 

0,915294 

29,7936 

*5 

0|0i94i5 
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3.4  OBTAINING  THE  TEST  PROPERTIES  OF  A THREE  DECISION  TEST  REGION 

This  section  will  explain  how  the  direct  method  of  sequen- 
tial analysis  is  used  to  find  the  exact  test  properties  for  a 
three  decision  sequential  test  for  the  hypergeometric  distr Lbution . 
The  procedure  is  easily  extended  to  tests  with  k'3  decisions. 

The  same  test  characteristics  found  for  the  test  in  Section  3.3 
are  calculated  for  the  three  decision  sequential  test  region  de- 
veloped in  Section  2.2. 

The  direct  method,  as  applied  here,  is  very  similar  to  the 

method  used  for  the  two  decision  test  treated  in  Section  3.3. 

Now,  however,  there  are  four  possible  events  at  each  trial.  The 

events  are  Al  , AO  , A2  and  c , denoting  acceptance'  of  il.,  II  , 
n n n n v 10 

H2  at  trial  n and  continuing  to  trial  n+1,  respectively.  As  with 
the  tv/o  decision  test,  it  is  necessary  to  compute  the  probability 
of  each  of  these  events  for  each  trial  n=l , 2 , . . . , nQ , where  n(J  is 
either  the  truncation  point  or  that  point  past  which  the  proba- 
bility of  continuation  is  small  enough  to  ignore. 

The  probabilities  of  each  of  the  above  events  arc  computed 
in  a manner  similar  to  that  used  for  the  two  decision  test  region. 
One  must  again  compute  the  probability  of  reaching  each  point  in 
the  sample  space.  These  probabilities  differ  from  the  actual 
hypergeometric  distribution  only  because  of  the  di f foronoe  in 
the  number  of  paths  available  to  reach  a given  point.  The  dif- 
ference in  the  number  of  paths  is  due  to  the  stopping  rules  of 
the  particular  sequential  test. 
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Figure  3.9  shows  in  detail  the  grid  for  part  of  a typical 
region  for  a three  decision  test.  The  probability  of  reaching 
each  point  is  again  computed  recursively,  moving  from  one  trial 
to  the  next.  When  a point  in  the  continuation  region  is  reached, 
another  sample  is  taken,  illustrated  by  two  arrows  leaving  each 
of  these  points.  When  a critical  point  is  reached,  a decision  is 
made  in  favor  of  one  of  the  three  hypotheses  and  the  test  is  ter- 
minated. The  recursive  formula  used  in  calculating  the  proba- 
bilities is: 


P(x,D,n+l,N)=I(x,n)P(x;D,n,N) ( (N-n-D+x) / (N-n) ) 

+1 (x-1, n) P (x-l;D,n,N) ( (D-x+1)/ (N-n) ) 


( 3 . J 3 ) 


where 


P (x , D,  0 ,N  ) = 


if  X=0 
otherwise 


I (x,n) 


1 if  (n,x)iC 

n 

0 otherwise 


The  indicator  function  I again  accounts  for  the  fact  that 

the  test  ends  when  one  of  the  critical  points  is  reached. 

After  al)  of  these  probabilities  have  been  computed,  the 

probabilities  of  the  events  Al  , AO  , A2  at  each  trial  are  then 

n n n 

the  probabilities  of  reaching  each  of  the  boundary  points.  That  is, 


P (Ai  ) = £ J . (x,n)  P (x;D , n ,N) 

n x 1 

Ji(x,n)  = 


(3.14) 


where 


1 if  (x,n)i  Ai 

n 

0 otherwise 


f-  Jl 


I 
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is  an  indicator  function  used  to  sum  the  probabilities  of  the 
proper  hypotheses. 

Once  these  probabilities  have  been  found,  we  can  compute 
the  desired  test  properties  as  follows.  The  distribution  of  the 
DSN  (as  a function  of  D,  the  true  state  of  nature)  is 

P (n;D)  = P(AlnUAOnU  A2n;D)  (3.15) 

= P(A1  ;D)+P(A0  ;D)+P(A2  ;D) 
n n n 

In  a three  decision  test,  there  are  really  three  OC  functions, 
each  giving  the  probability  of  acceptance  for  one  of  the  hypotheses. 
The  OC  function  for  hypothesis  i is  found  as  follows: 

n0 

OC. (D)  = Z P (Ai  ; D)  (3.16) 

1 n=l  n 

The  ASN  and  VSN  of  the  test  (as  a function  of  D)  are  then 
computed  as 

no 

ASN  (D)  = Z nP (n ; D)  (3.17) 

n=l 

"°  2 

VSN  (D)  = Z (n-ASN(D))  P(n;D)  ( 3 - 18) 

n=l 

The  expression  for  the  ASN  in  (3.9)  is  also  applimble  here.  The 
computer  program  listed  in  the  Appendix  calculates  the  above 
properties  and  they  have  been  computed  for  the  sequential  test 
obtained  in  Section  2.2  and  truncated  as  in  Section  3.2  of  this 
chapter.  The  truncated  test  region  is  shown  in  Table  2.4.  The  Lost 
properties  are  shown  in  Table  3.5.  graphs  of  the  ASN  and  OC 
functions  are  shown  in  Figures  3.10  and  3.11. 
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Figure  3.9  Probability  Grid  for  the  Three  Decision  Example 


Graph  of  the  ASN  Function  for  the  Three  Decision  Example 
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CHAPTER  4 

FURTHER  NUMERICAL  EXAMPLES 


4 . 0 INTRODUCTION 

In  the  first  section  of  this  chapter,  further  examples 
of  two  decision  truncated  sequential  tests  for  the  hyper- 
geometric distribution  are  given.  Ten  example  test  plans  have 
been  chosen  and  completely  evaluated.  These  examples  have  been 
included  to  show  how  the  tests  perform  over  a range  of  different- 
parameter  values.  The  computer  program  which  is  listed  in  the 
Appendix  was  used  to  develop  the  test  regions  and  to  obtain  prop- 
erties of  the  tests  which  are  given  here.  This  computer  program 
can  be  used  to  develop  and  evaluate  tests  of  the  hypergeoinetr i e 
distribution  for  any  desired  parameter  values.  In  the  second 
section  of  this  chapter,  an  illustration  is  given  to  show  how 
these  sequential  tests  are  used  in  practice. 

4 . 1 TEST  PLAN  EXAMPLES 

All  of  the  examples  of  the  sequential  test  plans  which  are 
given  here  are  truncated  at  n*,  the  sample  size  of  the  fixed 
size  sample  test.  The  desired  error  probabilities  are  chosen  to 
be  a=0.05  and  £=0.10.  While  these  error  probability  limits  are 
not  met  in  all  cases  when  using  the  sequential  test,  the  true 
values  are  always  very  close  to  the  desired  values  and  could  have 
been  made  closer  following  the  region  modi f ication  procedure  dis- 
cussed in  Section  3.2. 
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Because  of  space  limitations,  the  distribution  of  the  DSN 
is  given  only  for  the  first  example.  For  this  example  and  the 
others  which  follow,  the  fixed  size  test  and  its  corresponding 
OC  function  are  given.  Also,  tables  of  the  OC  and  ASN  functions 
are  given  for  each  of  the  sequential  tests.  The  original  sequen- 
tial test  region  and  the  modifications  made  to  it  are  also  shown. 

In  Chapter  5,  using  the  information  given  here,  comparisons 
will  be  made  between  the  sequential  tests  and  both  the  fixed 
size  sample  tests  and  tests  which  use  the  binomial  distribution 
as  an  approximation.  There,  the  differences  among  the  OC 
functions  of  the  different  tests  are  examined  and  the  relative 
advantages  of  using  a sequential  procedure  is  shown.  The  graphs 
presented  here  will  aid  the  comparisons.  For  each  of  the  test 
plans,  three  OC  functions  have  been  graphed.  These  are  the  OC 
functions  for  the  fixed  size  test,  for  the  sequential  hyper- 
geometric distribution  test,  and  for  the  binomial  approximation 
to  the  hypergeometric  distribution  test.  The  ASN  function  for 
the  sequential  hypergeometric  distribution  tests  is  also  qraphed. 
The  horizontal  dotted  line  on  the  graphs  represents  the  sample 
size  of  the  fixed  size  tests. 

4.2  USING  THE  SEQUENTIAL  TEST  PLANS 

This  section  will  present  an  example  which  shows  how  the 
sequential  test  plans  are  used  in  practice.  The  example  used 
here  could  be  used,  for  example,  in  an  acceptance  sampling  scheme 
to  examine  lots  of  automobile  tires  when  they  arrive  from  the 
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Test  Plnn  7 


N«  190 
D0>  10 

Dls  25 
ALPHAS  0.050 
BETAS  0.100 

EVALUATION  or  RES  I OS  RE9UESTEO 


The  FIXED  SUE  TEST  IS  AS  TOLLOuSI 
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Tost  Plan  8 


N.  100 
DO*  15 
Dl*  30 
ALPHA.  0.050 
BETA*  0.100 

EVALUATION  OF  REGION  REQUESTED 
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Figure  4.9  Properties  for  Test  Plan  9 
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manufacturer.  Each  lot  contains  30  tires  and  it  is  desired  to 
accept  the  lot  if  it  contains  D<5  defectives  (0^=5)  and  to 
reject  the  lot  if  it  contains  d>15  defectives  (D^=15) . The  a 
and  3 errors  are  chosen  to  be  0.05  and  0.10  respectively.  For 
this  test.  Test  Plan  1 is  appropriate.  The  truncated  sequential 
test  region  and  the  test  properties  are  given  in  Section  4.1. 

To  carry  out  the  sequential  test,  the  following  procedure  is 
followed.  At  each  stage  of  the  test  a tire  is  selected  at 
random  (without  replacement)  from  those  remaining  in  the  lot. 

The  total  number  of  defectives  which  have  been  observed  is  then 
cc.apared  with  the  critical  limits  which  define  the  test  region . 
This  is  continued  until  one  of  the  critical  limits  is  reached. 

A typical  sequential  sample  which  might  be  obtained  for 
tnis  case  is  shown  in  Table  4.1,  along  with  the  critical  limits. 
This  is  shown  graphically  in  Figure  4.11.  For  this  particular 
sample,  only  1 defective  has  been  found  at  the  7th  trial.  A 
decision  is  therefore  made  in  favor  of  and  the  lot  is  accepted. 
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Figure  4.11  Graphical  View  of  the  Sequential  Test 


CHAPTER  5 


COMPARISON  WITH  OTHER  TESTS 


5.0  INTRODUCTION 

In  this  chapter,  two  types  of  comparisons  are  presented. 

In  the  first  section,  comparisons  between  the  fixed  size  sample 
and  the  sequential  tests  of  the  hypergeometric  distribution  are 
given.  In  the  second  section,  sequential  tests  for  the  hyper- 
geometric distribution  are  compared  with  similar  tests  for  the 
binomial  distribution.  The  purpose  of  the  first  section  is  to 
show  the  relative  superiority  (with  respect  to  the  expected  sample 
size)  of  the  truncated  sequential  test  when  cc spared  to  fixed 
size  sample  tests  for  the  same  hypothesis.  The  second  section 
shows  the  possible  consequences  of  using  Wald  type  regions  for 
the  binomial  distribution  to  develop  approximate  sequential  tests 
for  the  hypergeometric  distribution. 

5.1  COMPARISON  WITH  FIXED  SIZE  SAMPLE  TESTS 

In  this  section  the  fixed  size  tests  described  in  Section 
1.3  will  be  compared  and  contrasted  with  the  sequential  tests 
given  in  Section  2.1  and  truncated  as  in  Section  3.2.  There 
will  be  frequent  references  to  the  tables  and  graphs  given  in 
Chapter  4.  The  OC  functions  for  these  two  tests  will  be  examined 
along  with  the  ASN  function  of  the  truncated  sequential  test. 

From  this,  it  will  be  quite  easy  to  see  the  relative  gain  in 
efficiency  (with  respect  to  the  expected  sample  size)  obtained  by 
using  the  sequential  test. 
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The  fixed  size  tests  used  here  were  developed  as  described 
in  Section  1.3.  They  are  the  (non-randomized)  tests  with  minimum 
sample  size  such  that  the  limits  on  the  desired  error  probabil- 
ities are  met.  From  examination  of  the  OC  function  for  these 
fixed  size  tests  (these  OC  functions  and  their  graphs  are  given 
in  Section  4.1),  it  is  seen  that  these  error  probabilities  are 
usually  quite  a bit  smaller  than  the  desired  error  probabilities. 
This  will  have  to  be  taken  into  consideration  (in  a subjective 
manner)  when  making  the  comparisons  presented  here. 

The  graphs  for  each  example  test  plan  in  Section  4.1  show 
the  OC  functions  for  both  the  fixed  size  sample  test  (dashed  line) 
and  the  sequential  test  (solid  line) , as  well  as  for  the  binomial 
approximation  test  (dotted  line) , which  is  explained  in  the  next 
section. 

Table  5.1  gives  the  true  a and  B errors  for  each  of  the  tests, 
along  with  the  sample  size  for  the  fixed  size  test  (which  is  also 
the  truncation  point  for  the  sequential  test)  and  the  maximum 
value  of  the  ASN  function  for  the  sequential  case.  The  following 
observations  can  be  made  about  these  example  test  plans. 

The  most  important  advantage  gained  by  using  a sequential 
test  procedure  is  an  overall  reduction  in  the  amount  of  sampling 
required  to  come  to  a decision.  This  advantage  is  again  demon- 
strated here.  The  maximum  of  the  ASN  function  for  each  of  the 
test  plans  ranges  between  25  and  33  percent  below  the  sample  size 
of  the  fixed  size  test.  There  is  no  doubt  that  these  tests  will 
result  in  a considerable  saving  with  respect  to  sampling  costs. 
There  is,  however,  a small  price  to  be  paid  for  this  saving. 
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The  a and  8 error  probabilities  of  the  fixed  size  tests 
are  never  any  larger,  and  are  usually  somewhat  smaller,  than 
the  desired  error  limits.  The  a and  8 errors  for  the  sequential 
test,  while  somewhat  larger  than  those  of  the  fixed  size  test, 
are  in  all  cases  quite  close  and  rarely  above  the  desired  limits. 
Examination  of  the  graphs  of  the  OC  function  also  illustrates 
this  point.  From  the  graphs,  it  is  again  seen  that  the  error 
probabilities  have  increased  somewhat,  but  that  over  all,  the 
OC  curves  are  very  similar.  This  is  the  price  paid  for  the  sample 
size  advantage  described  in  the  last  paragraph. 

It  should  be  pointed  out  the!  randomized  tests  meeting  the 
required  error  probability  limits,  which  may  have  slightly  smaller 
sample  sizes  than  do  the  fixed  size  tests  presented  here,  can 
usually  be  found.  This  type  of  test  is  not  often  used  in  practice 
and  tends  to  be  somewhat  of  a theoretical  contrivance.  Even  when 
compared  to  such  randomized  tests,  the  sequential  tests  are  usually 
superior. 

5.2  COMPARISON  WITH  SEQUENTIAL  TESTS  OF  THE  BINOMIAL  DISTRIBUTION 

In  this  section,  the  Wald  regions  for  tests  of  the  binomial 
distribution  are  examined.  First,  the  physical  characteristics 
of  these  regions  are  compared  with  the  regions  for  the  hypergeometric 
distribution  which  were  found  in  Section  2.1.  Then  the  binomial 


regions  are  used  as  approximate  tests  for  the  hypergeometric  dis- 
tribution. 
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For  completeness,  the  well-known  method  of  determining  the 
regions  for  a sequential  test  of  the  binomial  distribution 
(Wald,  1947)  is  briefly  reviewed  here.  The  same  notation  devel- 
oped in  Chapter  2 will  be  used. 

The  probability  mass  function  of  the  binomial  distribution 
is 

f (x,n,p)=^pX(l-p)n  X 0<x<n  (5.1) 

This  is  the  limiting  distribution  of  the  hypergeometric  dis- 
tribution if  N approaches  infinity  and  D/N  remains  constant. 

The  likelihood  ratio  used  to  test  the  hypothesis 


V p=po 

versus  P^Pj/Pg 

x.,  ,n-x 
»n -x 

P0(1~P0) 


is 

L (x; p^ ,n) 
L(x;p0,n) 


(5.2) 


(5.3) 


where  x is  the  number  of  defects  observed  after  n trials.  Fol- 
lowing the  same  procedure  given  in  Chapter  2,  the  rules  for  the 
test  are 


accept  Hq  if  L (x;p1 , n) /L (x;pQ ,n) <B 

accept  if  L(x;p^,n)/L(x;pg,n) >A 


(5.4) 


and  otherwise  take  another  sample.  The  log  likelihood  ratio 
function  is  found  to  be 
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g (x,p0/p1,n)=£n(L(x,p1,n)/L(x/p0,n) ) 


(5.5) 


=x-£n  (p1/pQ)  + (n-x)  • £n  ( (1-p.^  / (l-pQ)  ) 
The  critical  values  for  the  log  likelihood  ratio  are  then 


b=£n(B)=£n(8/(l-ct) ) 
a=£n  (A) =£n ( (l-$) /a) 


(5.6) 


We  then  find  the  critical  values  for  acceptance  and  rejection  at 
each  trial  from  the  inverse  function  g ^ when  solving  for  x. 

The  equations  for  these  are 

kL(n)=  g "k  h^p^p^njJ  (5.7) 

(b-n£n(  (1-p^/ (l-p0)  )/in(p1  (l-p0)/(pQ  (l-p1)  ) )] 
[g"1(a/pC)/p1,n)]  +1 

(a-n£n  ( (1-p^ / (l-p0) ) /£n  ^ (1— px ) ) )J  +1 


k(J(n)  = 


where  K= [R]  is  again  the  greatest  integer  less  than  or  equal  to  [U] 
The  inverse  functions  in  (5.7)  are  linear  in  n and  can  there- 
fore be  represented  by  two  parallel  lines  with  slope 


s=(  (l-p0)/(l-p1)/£n(p1  (l-pQ)/(p0  (1-p^  ) ) 


(5.8) 


and  intercepts 


IL=£n(8/(l-a) )/£n (px (l-pQ)/ (pQ (l-p1) ) ) (5  g) 

ly=£n(  (l-B)/a)/£n  (pj^  (l-p0)/(pQ  (1-Pj ) ) ) 

An  example  of  these  two  lines,  which  define  the  boundary  of  the 
sequential  test  regions,  is  shown  in  Figure  5.1.  In  the  examples 
which  follow,  the  critical  limits  kT  (n)  and  k„(n)  are  computed 

Lt  U 


Figure  5.1  Typical  Wald  Region  for  the  Binomial  Distribution 

and  compared  with  the  critical  limits  of  the  sequential  tests  of 
the  hypergeometric  distribution,  cL(n)  and  Cy(n). 

When  using  the  binomial  test  regions  to  approximate  the  test 
for  the  hypergeometric  distribution,  one  must  make  a decision  as 
to  which  values  to  use  for  p^  and  p^.  Some  approximations  for 
hypergeometric  distribution  are  given,  for  example,  by  Johnson 
and  Kotz  (1369).  These  approximations  are  dependent  on  the  sample 
size  n and  their  use  in  a sequential  test  would  be  as  complicated 
as  the  exact  test  is.  For  this  reason  pQ  and  p^  are  taken  here 
to  be 


P0=VN 


P1=D1/N 


(5.10) 


For  large  N and  small  n,  these  are  good  approximations  and  would 
likely  be  the  values  used  in  practice.  Equation  (5.7}  is  used 
to  construct  the  binomial  test  regions,  with  the  following  modi- 
fications. First,  if  Dq+1  defects  are  observed,  the  test  is  ter- 
minated and  is  accepted,  as  it  is  then  known  that  HQ  cannot  be 
true.  Also,  the  test  is  truncated,  again  following  the  procedure 
given  in  Section  3.2.  No  other  modifications  are  made  to  the 
test  regions. 

Table  5.2  gives  the  critical  limits  for  both  the  truncated 
hypergeometric  test  plan  (cL(n)  and  Cy(n))  and  the  binomial  ap- 
proximation to  this  test  (k_  (n)  and  ktt(n))  for  Test  Plan  3 (from 
Section  4.1)  where  n=50,  DQ=2  and  D.^12.  This  same  information 
is  given  for  Test  Plan  5 in  Table  5.3. 

Because  these  two  distributions  are  the  same  at  the  first 
trial,  the  critical  values  will  be  the  same  for  n=l.  The  main 
difference  between  these  two  tests  is  that  the  hypergccm^tric 
distribution  test  regions  tend  to  shrink  in  width,  especially  from 
above.  The  average  distance  between  cT  (n)  and  c.,(n)  decreases 
with  n until  the  truncation  point  is  reached,  v,-»i le  the  average 
distance  between  k_  (n)  and  k„(n)  remains  the  same  except  at  the 
points  where  the  test  is  truncated.  The  shrinking  of  the  hyper- 
geometric distribution  test  region  is  due  to  the  finiteness  of 
the  population.  Also,  the  binomial  regions  usually  cend  to  be 
shifted  by  a small  amount  from  the  hypergeometric  test  region. 

The  effects  of  this  are  shown  in  the  numerical  results  to  follow. 


Binomial  and  Hypergeometric  Test  Regions 
for  Test  Plan  #3  (N=50,  DQ-2,  £>±=12) 
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Table  5.3 

Binomial  and  Hypergeometric  Test  Regions 
for  Test  Plan  #5  (N=50,  DQ=20,  D1=30) 
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Table  5.4  summarizes  the  results  of  using  the  binomial 
regions  as  an  approximation.  The  actual  a and  0 errors,  along 
with  the  maximum  values  of  the  ASN  function,  are  given  for  each 
test  plan  in  Section  4.1. 

It  is  seen  from  Table  5.4  that  in  almost  all  of  the  examples, 
either  the  true  a or  the  true  0 error  is  quite  a bit  larger  than 
what  was  desired  and  that  error  in  the  opposite  direction  is 
smaller  than  what  was  desired.  This  can  also  be  seen  in  the 
shifts  in  the  graphs  of  the  OC  function  for  the  approximate  tests, 
as  shown  in  Section  4.1.  This  seems  to  indicate  that  the  test 
approximation  is,  in  a sense,  biased.  There  seems  to  be  no  sig- 
nificant difference  between  the  maximum  values  of  the  ASN  function 
for  these  tests.  This  is  reasonable,  as  the  tests  are  truncated 
at  the  same  trial. 

The  above  observations  indicate  two  things  to  the  user  of 
these  sequential  tests.  First,  the  binomial  regions  do  not, 
in  general  (except  for  cases  where  the  population  rize  N is  large 
when  compared  with  a typical  sample  size  of  the  test  procedure) , 
provide  adequate  tests  for  the  hypergeometric  distribution.  Also, 
the  importance  of  finding  the  exact  test  properties  to  compare 
with  the  desired  values  is  clearly  shown. 
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CHAPTER  6 


ESTIMATION  OF  THE  NUMBER  OF  DEFECTIVES 
AFTER  TERMINATION  OF  THE  SEQUENTIAL  TEST 

6.0  INTRODUCTION 

In  this  chapter,  a method  whereby  one  can  obtain  a point 
estimator  and/or  confidence  intervals  for  the  number  of  defectives 
in  a finite  population  is  discussed.  The  estimation  is  to  be 
performed  after  sequential  tests,  such  as  those  discussed  in 
Chapter  2,  have  been  terminated. 

In  general,  there  are  two  basic  types  of  sequential  estima- 
tion to  be  considered.  First,  there  is  the  problem  of  sampling 
sequentially  until  estimates  with  the  desired  degree  of  precision 
have  been  obtained.  This  might  be  expressed,  for  example,  by 
specifying  the  maximum  allowable  confidence  interval  length. 

Another  type  of  estimation  is  often  required  when  one  would  like 
to  estimate  the  parameter  in  question  after  completion  of  a se- 
quential hypothesis  test.  This  latter  type  of  estimation  is  con- 
sidered here. 

A brief  outline  of  the  history  of  sequential  estimation  is 
presented  here  first.  This  is  followed  by  a description  of  a 
general  method  of  estimation  (given  by  Schmee  (1974)  and  Goss 
(1974b) ) which  may  be  used  after  a sequential  test.  The  following 
sections  show  hov'  this  method  is  applied  to  sequential  tests  of  the 
hypergeometric  distribution.  A numerical  example  is  also  given. 
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6.1  HISTORY  OF  SEQUENTIAL  ESTIMATION 

This  section  presents  a brief  overview  of  some  of  the 
approaches  to  sequential  estimation  which  have  received  atten- 
tion in  the  past.  More  comprehensive  overviews  of  . quential 
estimation  are  given,  for  example,  by  Johnson  (1961),  Wetherill 
(1966),  Goss  (1974a)  and  Schmee  (1974). 

The  first  results  with  sequential  estimation  were  obtained 
by  Tweedie  (1945)  and  Haldane  (1945)  who  use  an  inverse  binomial 
sampling  technique.  Wald  (1947),  in  his  book,  gives  some  struc- 
ture to  the  problem  and  suggests  an  approach  which  he  admits  is 
not  optimal.  While  the  problem  was  not  solved  by  Wald,  his  work 
seems  to  have  led  to  the  later  results  given  by  Anscombe  (1953) 
and  Cox  (1952a  and  1952b).  These  procedures,  which  deal  with 
generalized  boundaries,  were  also  explored  by  Wolfowitz  (1946) , 
Blackwell  (1947),  Savage  (1947)  and  Knight  (1965).  Dixon  and 
Mood  (1948)  apply  the  "up-down"  method  to  sequential  estimation, 
and  Anscombe  Q953)  reviews  the  early  methods  of  sequential  es- 
timation. Most  of  these  early  efforts  were  primarily  aimed  at 
estimation  rather  than  hypothesis  testing  followed  by  estimation. 

Armitage,  in  his  discussion  of  Anscombe' s paper  (Anscombe, 
1953),  points  out  that  in  general,  the  suggested  sequential  esti- 
mation techniques  are  not  any  better  than  the  standard  fixed  size 
procedures.  He  also  stresses  the  need  for  methods  of  estimation 
to  be  performed  after  sequential  tests  of  hypotheses.  The  follow- 
ing is  a brief  review  of  the  work  which  has  been  done  with  such 
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Girschick  et  al.  (1946)  give  a simple  method  of  finding 
the  unique  unbiased  estimator  (UBE)  for  a binomial  SPRT.  For 
tests  of  the  binomial  parameter,  Armitage  (1958)  compares  the 
mean  square  error  of  the  maximum  likelihood  estimator  (which  is 
unbiased  in  a fixed  size  test,  but  not  necessarily  in  a sequen- 
tial test)  with  the  variance  of  the  UBE  and  gives  a method  of 
finding  confidence  intervals  which  meet  the  classical  probability 
statement,  and  which  are  dependent  on  the  stopping  rules.  Aroian 
and  Oksoy  (1972)  present  a Bayesian  procedure  for  estimation  and 
for  finding  confidence  intervals.  The  procedure  is  to  be  used 
at  the  completion  of  a sequential  life  test.  A generalization  of 
this  procedure  is  due  to  Schmee  (1974)  and  Goss  (1974c)  and  is 
the  basis  of  the  estimation  procedures  developed  here. 

To  complete  our  overview  of  sequential  estimation,  the  Bayes- 
ian approaches  should  be  mentioned.  This  subject  is  treated  by 
Wetherill  (1966)  and  DeGroot  (1970).  Box  and  Tiao  (1973)  discuss 
the  sequential  nature  of  Bayesian  methods  of  estimation. 

6.2  THE  GENERAL  METHOD  OF  SCHMME  AND  GOSS 

The  method  of  estimation  presented  here  is  similar  in  nature 
to  the  method  given  by  Aroian  and  Oksoy  (1972).  Schmee  (1974)  and 
Goss  (1974c) , in  their  respective  dissertations,  give  a general 
method  of  finding  point  estimates  and  confidence  intervals  which 
can  be  easily  applied  to  sequential  methods.  Schmee  (1974)  applies 
the  method  to  sequential  tests  of  the  mean  of  a normal  distribution 
with  variance  both  known  and  unknown.  Goss  treats  the  two-sided 
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test  for  the  mean  of  a normal  distribution  with  variance  known 
(Goss ,1974b)  and  tests  of  the  binomial  distribution  (Goss ,1974a ) . 

A description  of  the  method  follows;  it  is  applied  to  tests  of 
the  hypergeometric  distribution  in  the  following  sections. 

In  order  to  present  the  method  in  a general  fashion,  it  is 
best  to  consider  estimation  of  a continuous  parametex  from  a 
continuous  distribution  (e.g.,  the  mean  of  the  normal  distribution). 
When  estimation  is  to  be  performed  for  a discrete  parameter  or 
from  a discrete  population,  the  appropriate  integrals  will  become 
summations  and  probability  density  functions  will  become  proba- 
bility mass  functions. 

When  the  direct  method  of  sequential  estimation  is  used, 

one  computes  the  probability  of  reaching  or  crossing  each  point 

on  the  region's  boundary  (assuming  a test  in  discrete  time),  for 

different  values  of  the  true  state  of  nature.  The  method  for 

doing  this  is  treated  in  Chapter  3.  Let  P(An|0)  and  P(Rn|0) 

represent  the  probability  of  accepting  and  rejecting  HQ  respectively 

at  crial  n if  0 is  the  true  state  of  nature.  These  probabilities 

are  easily  found  by  using  the  direct  method.  If  we  take  a Bayesian 

view  of  the  situation  and  assume  a uniform  prior  for  0 (over  all 

possible  values  of  0),  a "pseudo-posterior"  (in  the  sense  that  it 

is  not  based  on  a sufficient  statistic  ) distribution  for  0 can  be 

found  by  using  Bayes'  Formula. 

P(Anl0) 

An]  ~ /P (A  | 0 ) dO 
0 n 

R._)  = P{RJ0) 
n /P (R  |0)d0 


p (0  I 


P(0 


(6.1) 


The  first  equation  is  used  if  the  hypothesis  is  accepted  at 
trial  n;  the  second  one  is  used  if  the  hypothesis  is  rejected 
there.  Although  this  method  is  valid  for  certain  distributions 
(e.g.,  the  binomial  and  hypergeometric),  there  can  be  both 
theoretical  and  practical  objections  to  it  in  other  cases. 

The  first  objection  is  that  if  there  is  more  than  one  out- 
come contained  in  the  events  A or  R , that  information  might 

n n 

be  lost  by  grouping  the  outcomes  into  one  event.  The  second 
objection  concerns  the  restriction  to  the  uniform  or  "non- 
informative"  prior  distribution.  This  is  especially  true  with  the 
so-called  "improper  prior"  which  ranges  between  plus  and  minus 
infinity  and  cannot  be  made  to  integrate  to  one  (Box  and  Tiao, 
1973).  Both  of  these  objections  are  eliminated  with  the  more 
general  method  developed  by  Schmee  and  Goss. 

The  general  method  makes  use  of  all  of  the  available  infor- 
mation and  allows  the  use  of  any  specified  prior  distribution. 

Let  P(0)  denote  the  prior  density  of  0 and  let  f(x,n|h)  denote 
the  density  of  x given  0,  where  x is  the  sequential  sample  ob- 
tained up  to  trial  n (where  is  either  accepted  or  rejected) 
and  O is  the  true  state  of  nature  to  be  estimated.  If  T(x) 
is  a sufficient  statistic  for  0,  no  information  is  lost  by  con- 
sidering instead  the  density  g(T(x),n|0).  Note  that  this  density 
considers  all  possible  values  of  the  sufficient  statistic  which 
are  possible  at  the  termination  of  the  sequential  test.  The 
density  g(t(x),n|0),  is  obtained  via  the  direct  method.  The 
posterior  is  then  computed  as 
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P (0  |T  (x)  n)=  g(T,(x)  ,.n|,e)p(8) 

Fl0|nx;,n;  /g(T(x)  ,n  0)p(0)d0 


(6.2) 


where  the  denominator  is  a normalizing  constant  used  to  make  the 
posterior  integrate  to  one.  In  a Bayesian  sense,  this  posterior 
is  sufficient  in  that  it  contains  all  of  the  prior  information 
and  all  of  the  information  obtained  from  the  sequential  sample. 

Using  the  posterior  density,  one  can  compute  Bayesian  con- 
fidence intervals  for  the  parameter  0.  For  example,  (0,0) 
where  0 and  0 are  determined  by 


/P(0lT(x)  ,n)d0  = y/2 
-00 

/P (0 | T (x) , n) d0  = y/2 
0 


(6..' ) 


gives  a 100(1-y)%  Bayesian  confidence  interval  for  the  state  of 
nature  0.  An  interval  of  shortest  length  can  be  found  by  fii.ding 

•V 

0 and  0 such  that 


f P (0|T(x),n)d0  = (1-y) 
0 n 


(6.4) 


and  0-0  is  a minimum.  The  expected  value  of  the  posterior 


E (0 ) =/0P  (0 | T (x) ,n) d0 
— <-■>  n 


(6.5) 


can  be  used  as  a point  estimator  for  0.  The  mode  or  median  of 
the  posterior  distribution  can  also  be  used  for  this  purpose. 
Other  percentiles  of  the  posterior  distribution  might  also  be  of 
interest, 

Schmee  (1974)  gives  some  discussion  on  the  choice  of  a prior 
distribution  (often  a point  of  controversy) . Box  and  Tiao  (1973) 
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also  treat  this  problem  at  some  length.  Here,  the  uniform  prior 
is  used,  although  another,  more  appropriate  prior  could  be  speci- 
fied by  the  user  of  these  procedures  if  desired. 

6.3  INTERPRETATION  OF  THE  POSTERIOR  DISTRIBUTION 

In  order  to  interpret  the  posterior  distribution  given  in 
(6.2),  one  should  take  a Bayesian  view  of  the  situation.  To  do 
this,  consider  D,  the  state  of  nature,  to  be  a random  variable. 

It  is  also  necessary  to  specify  a prior  distribution  for  D.  This 
can  be  (and  often  is)  a uniform  distribution  covering  all  possible' 
values  for  the  state  of  nature.  Such  a prior  distribution  is 
called  a "non-informative"  prior.  When  a Bayesian  approach  is 
used,  the  sample  likelihood  and  the  prior  distribution  are  com- 
bined by  using  Bayes'  formula  to  obtain  a posterior  distribution 
expressing  all  of  the  available  information  about  the  state  of 
nature. 

Some  statisticians  would  object  to  this  approach,  claiming 
that  the  parameter  to  be  estimated  is  a fixed  value  and  that 
only  the  sample  from  the  population  is  subject  to  random  fluctu- 
ation. One  who  follows  the  Bayesian  approach,  however,  argues 
that  with  the  information  from  the  sample  and  the  prior  dis- 
tribution which  expresses  the  prior  beliefs  about  the  parameter 
in  question,  a posterior  distribution  can  be  found  which  contains 
all  of  the  available  information  about  the  state  of  nature.  While 
arguments  still  persist  as  to  which  of  these  approaches  is  the 
proper  one,  the  latter  one  will  be  used  here  for  purposes  of 
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estimation.  Some  justification  for  taking  this  approach  follows. 

The  best  justification  for  considering  the  unknown  parameter, 
say  D,  a random  variable,  is  that  in  many  cases,  it  is  in  fact 
random  in  a sense.  Consider,  for  example,  a situation  where  lots 
of  100  transistors  are  to  be  accepted  or  rejected  depending  on 
an  indication  from  a sequential  sample  about  how  many  defectives 
are  in  the  lot.  Even  if  one  is  unwilling  to  consider  the  number 
of  defectives  in  a given  lot  a true  random  variable,  the  posterior 
distribution  can  still  be  used  to  express  the  relative  "degree  of 
belief"  for  different  values  of  the  parameter. 

Besides  the  justification  given  above,  the  Bayesian  pro- 
cedure allows  one  to  easily  obtain  both  point  and  interval  esti- 
mates for  any  parameter.  In  addition,  the  posterior  distribution 
itself  is  available  and  because  this  is  a likelihood  procedure 
and  the  distribution  sample  space  is  countably  finite,  our 
estimates  depend  only  on  the  observed  data  (and  the  assumed  prior) 
and  are  independent  of  any  stopping  rules. 

The  proponents  of  the  classical  approach  to  estimation  do 
not  argue  against  using  Bayes'  theorem  per  se , but  rather  against 
considering  the  unknown  parameter  a random  variable,  and  with 
the  specification  of  a prior  distribution  based  on  subjective 
probability.  The  effect  that  the  prior  distribution  has  on  the 
posterior,  however,  can  be  minimized  by  using  the  "non-informat ive" 
prior  discussed  in  the  last  section. 


1 

] 
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6.4  ESTIMATION  WITH  THE  HYPERGEOMETRIC  DISTRIBUTION 


1. 


i. 


* 


I 


This  section  treats  sequential  estimation  when  sampling  is 
from  a finite  population.  The  estimation  is  to  be  performed 
after  the  completion  of  a sequential  test  of  a hypothesis,  as 
explained  in  Section  6.2.  Johnson  and  Kotz  (19b9)  mention  some 
fixed  size  sample  methods  for  this  distribution.  Also,  a method 
of  finding  confidence  intervals  when  sampling  from  a finite  popu- 
lation is  given  by  Katz  (1971).  Some  of  these  methods  couJd  be 
applied  to  the  situation  here;  however,  the  Bayesian  approach,  as 
explained  above,  is  used  instead. 

The  following  explains  how  the  general  method  outlined  in 
Section  6.2  is  applied  to  sequential  tests  of  the  hypergeometrir 
distribution.  The  probability  mass  function  of  the  hypergeometr i r 
distribution  is 


h (n;N,n,D)  = 


Pi  (N-D) I n!  (N-n)  I 

N!  (D-x) ! xl  (n-x) ! (N-D-n+x) ! 


(6.6) 


This  is  the  probability  of  observing  x defects  in  a fixed  size 
test  with  sample  size  n.  For  a sequential  test,  the  probability 
of  observing  x defectives  at  trial  n is  always  less  than  or  equal 
to  the  probability  obtained  in  (6.6).  This  is  due  to  the  stopping 
rules  of  the  test  and  their  effect  on  the  number  of  "admissible" 
paths  to  a point  in  the  (n,x)  space. 

The  probability  of  reaching  the  point  (n,x)  under  the  sequen- 


tial test  rules  is 


h (x;N,n,D)=K (n,x) 


Pi  (N-D) 1 (N-n) ! 

N!  (D-x) l (N-D-n+x)! 


where  K(n,x)  is  the  number  of  "admissible"  paths  to  point  (n,x), 
as  dictated  by  the  sequential  procedure.  K(n,x)  is  in  general 
quite  difficult  to  determine  directly;  however,  the  direct  method, 
as  explained  in  Chapter  3,  will  allow  one  to  easily  compute  the 
probabilities  in  (6.7). 

For  each  decision  point  (n,x) , (i.e.,  a point  at  which  the 
test  is  terminated  and  a decision  is  made  in  favor  of  one  of  the 
hypotheses)  it  is  desired  to  obtain  the  posterior  distribution 
for  D,  the  number  of  defectives  in  the  population.  Following 
the  procedure  outlined  in  Section  6.2,  a prior  distribution  for 
D is  assumed.  Here,  a uniform  prior  will  be  used  so  that 


P(D)  = 


(N+l) 


0<D<N 


(6.8) 


Of  course,  any  desired  prior  could  be  substituted  for  this. 
Using  (6.2)  and  (6.7), 


P (D  x,n)= 


hs(x;N,n,D)P  (D) 

£h  (x;N,n, j)P  (j) 
J s 

f (x,N,n,D) 

T, f (x,N,n, j) 

3 


(6.9) 


f (x,N,n,D)= 


(N-D) I 

[D-x) ! (N-D-n+x) 1 


and  x is  the  number  of  defectives  observed  at  trial  n. 


The  denominator  in  (6.9)  is  a normalizing  constant  used  to 
force  the  posterior  to  sum  to  one.  The  posterior  in  (6.9)  is 
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independent  of  any  stopping  rules  and  is  easily  evaluated 
directly.  This  gives  the  desired  posterior  probability  mass 
function.  As  explained  in  Section  6.2,  this  distribution  is 
interesting  in  itself,  but  it  is  also  useful  to  find  quantities 
such  as  the  expected  value  or  the  mode  of  the  distribution  and 
confidence  intervals  for  the  number  of  defectives  in  the  popu- 
lation. This  is  done  as  follows. 

The  expected  value  (or  mean)  of  the  posterior  distribution 
is  found  by  using  (6.9)  and  changing  the  integration  in  (6.5) 
to  a summation. 

A 

D = E (D)  = ED-P (D I n ,x)  (6.10) 

D 

which  can  be  shown  (Zacks,  1971)  to  reduce  to 

D = (x+1)  (N+2)/(n+2)-l  (6.11) 

It  should  be  pointed  out  that  the  estimation  procedure 
described  here  is  equally  valid  for  the  k>2  decision  test.  The 
procedure  to  be  used  is  exactly  the  same  as  it  is  for  the  two 
decision  test.  This  is  because  the  estimates  depend  only  on 
the  observed  data  and  not  on  the  particular  stopping  rules  of 
the  test  (the  stopping  rules  do,  however,  dictate  points  in  the 
(n,x)  space  at  which  the  t.est  might  terminate).  In  fact,  the 
same  procedure  is  also  directly  applicable  to  fixed  size  pro- 


cedures. 
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6.5  NUMERICAL  EXAMPLE 

Presented  here,  as  an  example,  are  the  numerical  results 
of  the  estimation  procedure  for  Test  Plan  1 from  Section  4.1. 
Table  6.1  shows  the  posterior  distribution  for  D,  the  expected 
value  of  this  distribution,  confidence  limits  for  D and  the 
actual  confidence  level.  These  are  given  for  each  decision 
point  on  the  boundary  of  the  sequential  test  region. 
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Appendix 

Computer  Programs  Used  to  Develop  and  Evaluate  the  Test  Plans 
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1 
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i. 

f. 

i 

t 

i. 

i 

i. 


lonoc 

1010C<»< 
102UC* 
i 03on* 
104UC* 

1 050C* 

1 06UC* 
1070C* 

1 OBUC* 

1 090C» 
1100C* 

nine... 

112UC 
113U 
114U 
H50 
1160 
1170 
1180 
1190 
1200C 
1210  1 
1220 
1230 
1240 
1250 
1260 
1270 
1 28u  44 
1290 
1300 
1310 
1320 
1330 
1340 
1350  15 
1360 
1370  43 
1380  21 
1390 
1400 
1410 
1420 
1430 
1440 
1450 


ONE  SIDLO  SEOUENT I A L 1EST  OF  THE  HYPERGEOHE TR 1 C DISTRIBUTION 
UILL1AH  0.  HpfcKfcR,  JR 

INSTITUTE  Of  ADH I N I ST  >'A  T I OH  AND  HANaGEHENT 
UNION  COLLEGE 

SCHENECTADY.  NFW  YORK  12308 
JULY  1974 


D 1 HENS  I ON  X(?U0).  Y(20ll) 

OPTION  LOAD 

tOIII  VALENCE  (A.  ALPHA),  (R.RETA) 
fILENAHE  INPUT. I OUT 
LOGICAL  LEV. LDSN, LEST 
IN1EGFR  X , Y 
IHTFOFK  FV.EST.DSN 
DA  1 A INPUT. I0UT/5U, 66/ 

PP IN  I, "INPUTS  N.O0.D1. ALPHA.HFTA.HO" 
INPUTS"" 

|OUTsM" 

HE AIK  INPUT. 21) XN, DO. 01, ALPHA, BETA, XHO 
IF(XN.EO.O.O)  GO  TO  99 
PRINT. "INPUTS  EVaL.OSN.EST" 

HEAOT INPUT. 21) EV, DSN, EST 

T orhat (lull) 

I OUT=" " 

|NPIH  = "“ 

IOO=DD 

101=01 

N=XN 

HR  I TE < I OUT. 15) N, IDO, I D1 , ALPHA , 8E TA 
FORHAT ( ///"1N=  ",  15/"  i.0=  ",I4/"  Dl=  ",14/ 
4 " ALPHAS  "»F6.J/"  riETAs  ",F6.3) 

FORHAT(6F10.U,3ll) 

FORHAT (V) 

LFV=. FALSE. 

LDSNs.FALSF. 

LEST=. FALSE. 

IFIFV.EO.l  )LEV  = . TRUE. 

IF ( DSN.EO. I )LDSN  = . TRUE. 

IFIFST.EO.l  )LFST  = . TRUE. 

1 F ( LEV  ) PR  I NT  11 


12! 


1460  11 
1470 
1480  12 
1490 
1500  14 
1510 
1520 
1530 
1540 
1550 
156U 
1570 
158U 
1590C 
160UC 
1 6 1 0 C 
1620 
16J0C 
164  0C 
1650C 
1660C 
167  IlC 
1680 
1690  10 
1700 
1710  72 
1720 
1730  20 
1740 
1750  73 
1760 
1770  40 
1780  74 
1790  30 
1800C 
181  OC 
18200 
1830 

1840  112 
1850 
1860 

1870  1212 
1800 
1890 
1900 
1910 
1920 
1930 
1940 

1950  789 
1960 


} OHhAT  ( " EVALUATION  (If  REGION  REQUF  STFD"  ) 
ir (tev.AHn.LosNii'PiNi  t? 

f OKMAT  < " DISIRIHUTION  Of  THE  OSN  RFOUFSTLD") 
ir (LEv.ANn.usni'RjMT  h 

FORMAT ( “ ESTIMATION  Of  PARAHETFR  REQUESTED"  ) 

PRINT, "INPUT:  ni.PU.lNC" 

RF  A n ( I NPll 1 , 21 ) l>L , DU,  D ! 

If  (ill.  EO.  0.0)  CO  TO  98 
If (DL.GT.niO  00  ID  98 

I n i = d t 
I n u= ijij*  l . 

I DL=DL*1. 

MO=XHO 

GO  TO  FIND  1 HE  SMALLfST  FIXED  SIZf  TEST 

CALL  FIXSIKN,  I DU  , ID]  .ALPHA. BETA, NEST,  I DL  » I DU,  I D I .* 

MO<0  NATURAL  TRUNCAIION 
Ml)  = 0 f I XFIl  SIZE  TEST 
H0>0  I RUNCA 1 I UN  AT  HO 

IF ( HO  ) 10 , 2D, 4 0 
M0  = N 

PRINT  72 

rORMAK-oTRUNCATE  AT  THE  NATURAL  END  Or  THE  TEST") 
GO  TU  30 
H0=NEST 
PRINT  73 

FORHAT("OTHUNCATE  AT  THE  FIXED  SIZE  SAMPLE") 

GO  TO  30 
PRINT  74, MO 

rORMAT("0 TRUNCATE  AT  TRIAL  ",!5) 

CONTINUE 

FIGURE  RALO  REGION  FOR  DESIRED  SEQUENTIAL  TEST 

CALL  REGION (X,Y, A, R,  1 DU  .■  Ol.N.HO) 

CONTINUE 

PRINT, "INPUT:  REGION  CHANGE" 

HEAD( INPUT. 21  )Xt,XJ,XK 
F0RMATI7F1 0.0  I 
I I = X | 

IF( I I.LE.O)  CO  TO  235 
JJ  = XJ 
KK  = XK 
X(  I I )=JJ 
T(  1 I ) = KK 

HR  ’ TE( 1 OUT ,709)1 1 , JJ.KK 
FORMAT ( " RIGION  CHANGE". 413) 

GO  TO  112 


1970  735 

1980C 

1990C 

?oouc 

2U  10 
2020 
2030  98 
2040  9999 
7050  99 
2060  999 
2070 
2080 
2090 

2100C  

21  IOC 
7120C 
7 1 3 0 C 
7140C 
215UC 

2160C  

2170C 
2180 
2190 
2200 
2210 
2220 
2230 
2240 
2250 
2260C 
2270C 
2200C 
2290 
2300C 
231 UC 
2320C 
2330 
2340 
235uC 
7360C 
2370C 
2380  2840 
2390 
2400 
2410C 
2420C 
2430C 
2440  2640 
245U 
2460C 
2470C 


CONI  I HUE 

GO  10  EVALUATE  RFGIOH  AND  ESTIMATE  PARAMETER  ir  OESIRED 

IF (LEV) CALL  fcVAL{X,Y,L£ST»N,LDSN,HO» 1OL.I0U, 1 01) 

GO  TU  1 
PRINI  9999 

rOKMAK"  •••  INPUT  ERROR  «••") 

PRINT  999 

FORMAT!"  £01)  Of  HUN") 

STOP 

ENP 

SUBROUTINE  FIXSIZ(N,  iDn, 1 01 . ALPHA , HE  I A , Nt S 1 » f PL, I PU. IP!  ) 


THIS  SIIHRUUIINE  OF  T£  RH I NFS  THL  UNRAMPpM 1 7EP  FIXED  SIZE  TEST 
MAVINli  THE  SMALLEST  POSSIBLE  SAMPLF  SI  7b  WHILE  STILL 
MEETING  the  specified  ERROR  PROHAII I l 1 1 y lihits. 


PA  T A I OUT/66/ 

FILENAME  I OUT 

UAT4  IBLNK. ISTR/1H  ,1H*/ 

MA  RK  s I PLNK 
lOUTr"" 

FN=FLNGIN) 

F I XDO=f LNG( IP0)*FLNG(N-iD0)-FN 
F|XP1  = FLNG(  101  )*FING(N-IP1 )-FN 
AM=1. -ALPHA 

FIND  FIRST  FSTIHATE  USING  METHOO  OF  GUENTHER 
NESTsCHEST (N, 100# 1 01  * ALPHA  * SET A ) 

THY  IT 

CALL  TRY (All, PE  I A, 100, I PI , F I XPO , F I XD1 , N, H4 , NEST , I C, ALPHtP, RbTAP  ) 
IF(H4)2640, 2640,2840 

CONDITIONS  MET— HEDUCF  SAMPLE  SIZE 

NEST=NLST-1 

CALL  THY (AH, BET  A, 100, I PI , F I X00 , F I XD1 , N, M4 , NEST, I C, ALPHAP, BE T AP ) 
IF(H4)2840, 2640, 2840 

CONDI T IONS  NOT  MET— INCREASE  SAHPLE  SIZE 
NEST=NFST»1 

IF(NESl-N) 2960,2960, 2680 

METHOO  F A I LS--USE  DEFAULT  VALUFS 
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t 

~± 


2480C 

2490  2680  HARK=|STR 

2500  I C= i no 

2510  NFS  1 =N“ 1 1)1 

2520  GO  TO  2760 

2530  2960  CALL  THY ( A H. BF1 A, 1 00 , 1 1.1 , F I XDO, F I XD1, N,  H4»  NEST , !C, ALPHAP, HFT AP ) 
2540  IF!H4  12760,2640,2760 

2550  2760  Al  l'HAP=l. -ALPHAP 

2560  49  FORMAT!///"  IMF  FIXEn  SIZE  TFSt  IS  AS  F OLLOUS: “ , 1 X , Al /"  SAMPLE  S I 2 
2570  At  = “.15/"  CHI1ICAL  VALUE  = 15/"  ALPHA*  = ",F10.5/ 

2580  A"  BETA*  = ".flO.5) 

2590  WRITE! I OUT, 49 ) HARK, NEST. I C, ALPHAP,  HE TAP 

2600  ICO=IC*l 

2610  WRITE! I OUT , 16 ) 

2620  16  FORMAT!//"  FIXED  SIZE  TEST  OC  FUNCT t OH: "// ) 

2630  WP I TE ( I OUT , 689  ) 

2640  689  FOHHAf I "0  D ACCEPT  HO  ACCEPT  HI") 

2650C 

2660C  CALCULATE  THE  OC  FOR  THE  FIXED  SIZE  TEST 

2670C 

2680  DO  22  I = I DL, I DU, I DI 

2690  | D- I -1 

2700  S1=0. 

2710  110  3.1  ICP1  =1 , ICG 

7720  I C I = ICP1-1 

273(1  33  SI =S1 ♦ I HYPER  ! N,  NEST , 1 0,  1 C T ) 

274  0 S7=1.-S1 

2/50  22  WR|  IF! I0UT.56) ID  ,S1,S7 

2760  56  I ORMATdX,  15, 7f  10.6) 

2770  kFTURN 

7780  END 

2790  Slum  nil  T I NF  RfcGIONIX,  Y,A1,P1,  IDO,  IDl.N.HO) 

7H00C 

7h)  0 C - 

2870C 

?ft30C  Tills  SUBROUTINE  FINOS  THE  RIGIONS  TOR  A UNF  SIDED  TfST  IlF  THE 

284UC  H rpfc  ROE  OME  TRIG  DISTRIBUTION 

2850C 

2 1>  6 U 

2H70C 

2880  LOO  I CAL  XNG, YNG, 0 

7m9ii  INTEOEK  U5 

2900  IHTFGEH  X,Y 

2910  DIMENSION  XIH0)»Y(H0) 

2920C  DATA  IOUT/66/ 

2930  rilEflAHE  TOUT 

2940  10UI="" 

2950  XLO=FLNG!  1 1)1  ) ♦FINO ! N- I 111  ) -!  LHG ! N- 1 1)0  ) -FLNO ! I DO  ) 

2960  WRITE! I OUT , 16 ) 

2970  16  FORMAT!///"  THE  wALU  REGION  IS  AS  FOLLOWS:"/ 

2980  & /"  TRIAL  ACCEPT  HO  ACCEPT  HI") 


1 


■M 


2990 

A=ALOG<  (1.-R1  l/Al) 

* 

__ 

300U 

H=AL0G(fll/U.-Al)> 

1 

1 

3010C 

j 

1 

3 112  0 C 

1 NCREHENT  TRIAL  NUMBER 

i 

c 

3030C 

U5  IS  THE  UPPER  L I H 1 T( ♦ 1 ) ON  THE  NUMBER  OF  DEFECTIVES 

1 

3040C 

L5  IS  THE  LOWER  L!H|T<0>  ON  THE  NUMBER  OF  DEFECTIVES 

I 

uli50C 

| 

1 

30611 

Dtl  22  1 = 1,110 

| 

f 

*- 

3070 

L5=CHAX0(0» I-N«I01-1))»1 

1 

3UB0 

I F 1 I.GJ.l)L5=HAXfl(L5,X<I-l>) 

! 

3090 

U5  = I *? 

1 

3100 

n=. FALSE. 

-■ 

3U0 

1)0  33  K=L5,Ub 

3120 

K2=K-l 

3130 

•F(X2.CE. I-N* ID  1)  00  TO  2380 

I 

3140 

71 =-l.E35 

1 

3150 

GO  TO  2540 

4 • 

3160  2380 

1F(K2.LE. IDO  ) GO  TO  2440 

V' 

3170 

XL=1.E35 

r 

*-  - 

l 

. 3100 

GO  TO  2540 

319UC 

j 

1. 

3200C 

FIGURE  L I KL 1 HUGO  RATIO  AND  PROPER  ACTION 

j 

k 

3210C 

i 

3220  2440 

XI  =XL0*FLNG<N-ID0-I*K2)*FLHG( ID0-K2)-FLNC{ ID1-K2)-FLNG(N-ID1-I«K2) 

i 

| 

3230  2540 

!F(XL.LE.R.0R.K.NE.L5>  GO  TO  2720 

! 

A - 

3240 

X( I )=-l 

i 

k 

3250 

Os.TRHE. 

! 

3260  2720 

IFlO)  GO  TO  2800 

1 

3270 

IF(XL.LE.R)  GO  TO  2800 

1 

L 

t 

32«0 

XI I)=K2-l 

j 

4 

3290 

0=. TRUE. 

| 

f 

3300  2800 

1FIXL.LT. A)  GO  TO  33 

i 

i 

r 

3310 

IFIK7.CT.  I)  GO  TO  2900 

1 

3370 

V 1 1 )=K2 

i 

w '. ' 

» 

3330 

GO  To  2980 

i 

j 

BlS 

3340  33 

CONTINUE 

] 

. 

3350  2900 

Y ( I )=-l 

} 

3360C 

i 

i 

3370C 

PRINT  REGION  UOUNDRY  POINTS 

I 

338IIC 

" 

3390  2980 

XHG=X< | ) .RU.-l 

; 

“ 

3400 

YMG=r< 1 I.EO.-l 

3410 

IF!  1 -MG  ) 173, 37 J » 123 

3470  321 

X( 1 ) = (X< 1 ) * Y ( 1 )>/2. 

3430 

Y(l)sX(|)«t 

3440  123 

1 F < XNG. AND. TNG  ) CO  TO  10 

3450 

IF(XNG)  GO  TO  11 

3460 

IFIYNG)  GO  TO  12 

3470 

WRITE! 1 OUT»41 ) 1, X< 1 ), Y( I > 

3480  41 

FORMAT (IX, I4,4X» 14,7 X# 14) 

r 

3490 

GO  TO  20 

b 


1 


3500  10 
351C  42 
3*20 
3530  11 
3540 
355»  12 
3560  43 
357u  44 
3580C 
3590C 
360DC 
3610  20 
362(1  22 
3630  40 
3640 
3650 
3660C  - 
3670C 
3680C 
3690C 


3700C 

3710C 

3720 

3730 

374il 

3750 

3760 

3770  1 

37fi0 

3790 

3HO0 

3810 

3820  92 

3830 

384  0 

3850 

3860  C 

387UC 

3880C 

3890C 

390tiC 

3910C 

392UC  

3930 

3940 

3950 

3960 

3970 

3980 

3990 

4000 


HR  | Tfc'(  t OUT, 42 ) I 

FORHAKIX,  I4,7X#1H»»J.OX»1HO 

80  TO  20 

MRJTE<IOUT,43>l,YU> 

00  TU  20 

WR|Tfc(|OUT,44)I»X(l) 

FORMAT (IX, |4,7X»1K»,7X» |4) 

FORMAT! IX, 1 4 , 4 X, 1 4 , 1 0 X , 1H«  > 

CHECK  F UR  NATURAL  END  OF  TEST 

IF(  Yd  ) . EO.  X(  1 ) *1 ) CO  TO  40 

CONTINUE 

RETURN 

END 

FUNCTION  CHESTtN, IDO, 101, ALPHA, BETA) 


F I CURE  APROX  SAMP1 E SIZE  USING  METHOD  Of  OUKNTHER  MODIFIED 
KEEKER 


XN=N 
00= I on 
01 =I0T 
XH  = 1 . »BE  T A 
C=0. 

C=C- 1. 

IF(C.CT.01*1 > CO  TO  92 
I L = XNVAL ( XN, 01 ,C, XB ) ♦ . 9999V99 
lw=XNVAL(XN, On, C, ALPHA ) 

IF( TL.UT.  MU  CO  TO  1 

CHESf=IL 

RETURN 

ENO 

SUBROUTINE  TRY  (AM,  BETA,  100,  I 01 , F I XDO,  F I X01 , K4 , NEST,  IC,St,S2) 


TEST  '0  SEE  IF  NEST  IS  A LARGE  ENOIlUH  SAMPLE  SIZE  TO  MEET 
ERROR  LIMITS. 

H4  = l IF  CONO I I I ONS  ARE  MET.  0 OTHERWISE 


f C=FLNC(N- NEST )«FLNC( NEST  ) 

FIXAnn=FC«FlXDO 

FIXAMI=FU*riXDl 

S1=0. 

S2  = 0. 

M4  = 1 

|7=HAX0(0,NEST« I00-N>»1 
|6=H|N0( ioo,nfst)»i 


BY 
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] 

3 

] 

] 


4010 

4020 

4030 

4040 

4050 

4060 

4070 

4060 

4090  22 

4100  3600 

4110 

4120  3540 

4130 

4140 

4150 

4160 

4170 

4180 

4190 

4200 

4210C 

4220C  

4 23  0 C 
424UC 
425UC 
4260C 
4270C 

4280C  

4290C 

4300 

4310 

4320 

4330 

4340 

4350 

4360 

4370 

4380 

4390 

4400 

4410 

4420C 

4430C 

4440C 

44500 

4460 

4470 

4480 

4490C 

4600C 

451UC 


DO  22  1*17.16 
I C= I -1 

S2=S?*EXP<FIXAD1-FLN0< IDl-IC)-FLNG(N-ll)l-N£ST»IC)-FLNG<  IC) 
4-FLNO(NEST-lC)) 

S1=S1*EXP(FIXAD0-FLNG( I DO- 1 C )-FLNG( N- I 00-NFST* I C ) -FLNG C I C > 
i-FLNG(NEST-IC)) 

1F(S2.CT.RETA>  GO  TO  3500 
IF(Sl.GT.AM.AND.S2.L£.RETA>  GO  TO  3540 
CONTINUE 
H4  = 0 


RETURN 
1 C= I -1 
RETURN 
ENG 

FUNCTION  XnVA(.(XN<D»C.PRGB) 

XNVAL=CHIS0(1.2.«C»2.,PR0R> 

XNVAL=(XNVAL»(2.»XN*1.-0*.5*C)*C«(2.»0-C) >/ ( 4 . »D-2 . »C*XNVAL > 

RETURN 

END 

SUBROUTINE  EVAL(X,  T , EST, N, DSN, MO , I DL, I DU. ID  I ) 


THIS  SUBROUTINE  EVALUATES  THE  REGION  FOR  A SEQUENTIAL  TEST  OF 
The  HTPERGEOMETRIC  DISTRIBUTION  (ONF  SIDED  TFST). 

ESTIMATION  OF  THE  PARAMETER  IS  OFFERED  AS  AN  OPTION 


INTEGFR  X.X.O 
REAL  N1.N9.N2 
LOGICAL  NOR 
LOGICAL  IISN. EST 

F 1 1 frame  lour 

DIMENSION  HARM 50  ) 
IMMERSION  IHOLD(bO) 
DIMENSION  X<M0>,Y(M0) 

D l me  ns  I on  A(2nn),B<200 
DIMENSION  PROB (200) 
EOtll  VALENCE  (NHVNT.nP 
FOUI  VALEnCEIXMEAn*  '•  * !» 


) 

) 

MEANU  > 


) 


NPS  IS  THE  MAXIMIIH  NIIHBI  R OF  DFCISION  POINTS 
UMAX  IS  THE  HAXJHUM  POPULATION  SIZF 

DATA  IST6P/9/ 
lOOT*"* 

IF (.NOT. DSN) WRITE! I OUT. 432) 


I NCRENEH 


T THE  DEFECTIVE  NUMBER 


* 
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4920  DO  22  in=IDL,ll»U,IOI 

4930  11=  I [i-l 

4940  |F<nSN)WH|TE<IUUT,4921  10 

4990  4921  FORhaT { ///"  NUMBER  OF  DEFECTIVES  = ",!9/> 

4960  432  FORHaT (94H0  TRUE  0 P(HO)  P{H1>  *SN  VSN  ) 

4970  43  FORMAT ( 46H0  TklAl  P(llfl)  P(H1>  PC  T > P(C)  ) 

49H0  |F<RSN)«R[ TEC IOUI,43> 

4990  A9  = 0. 

4600  H9=0. 

4610  N9  = 0. 

4620  V9=0. 

4630  NP  = 0 

4640  DO  77  1=1, HO 

4690  77  A(l)=n. 

4660C 

4670C  I HCREMENT  THE  TRIAL  NUMBER 

46H0C 

4690  DO  33  1=1, HO 

4700  N1 = I 

4710  13=1*3 

4720  N2=Nl»Nl 

4730  DO  BB  J=1 , HO 

4740  BB  B(J)=fl. 

4790  IF( I-l 129,29,26 

4 7 6 11 C 

4770C  FIGURE  PROBS  TOR  FIRST  STEP 

47B0C 

4790  29  FK?)=FLOAT(D)/I LOAT(N) 

4B00  H(1 1=1 .-B(2) 

4B10  GO  Tu  3940 

48?0C 

4830C  ROUNnHY  POINTS 

4B40C 

4B90  26  LPl=X< I-l ) 

486U  LP2=r(l-t) 

4M.70  LPC  = X ( I 1 

4BU0  L°4=T{ | ) 

4890  IP9=  MAXP (0, LP1 1*1 

4900  IF (LP7.EO.-1 ) GO  TO  2fl 

4910  ?7  LPH  = HIN0U  P2,  I 1*1 

4920  GO  TO  2960 

4 V30  2ft  LP8= I *1 
494  CC 

4990C  HOVfc  PROBS  10  THE  NEXT  STEP 

4960C 

4970  2960  DO  34  J=LP9,LPB 

49B0  O=MAX0( ID-J,0> 

4990  01=M-I*l 

9000  S=0/0l 

9010  B(J»=B( J)*A( J)*(l.-S) 

9020  H( J*1)=B(J*1 )*ACJ)‘S 


J 

J 

*•** 

J 

I 

J 

7j 

D 

< , 

j 


5030  34 

5040 

5050 

5060 

5070 

5000  51 

5090C 

510UC 

5110C 

5120 

5130 

5140 

5150  222 
5160 
5170  52 
5180  53 
5190 
5200 
5210 

5220  3180 

5230C 

5240C 

5250C 

5260  3181 

5270 

5280  333 

5290 

5300  54 

5J10C 

5320C 

5330C 

5340 

5350 

5360 

5370 

5380  356 
5390 

5400  445 

5410  449 

5420  3500 

5430C 

5440C 

5450C 

5460 

5470  3540 
5480  996 
5490  33 
5500  3580 
5510 

5520  229 
5530 


CONTINUE 
P6  = 0. 

P7  = 0. 

IF(LPC.LG.LPl)  00  TO  52 
I F ( LPC+ 1 >51 ,52,51 
LP=H*X0  ( 0. 1. P 1 ♦ 1 )*1 

ACCEPT  POINTS 

LPCl=LPC*l 
DO  22?  J=LP.LPC1 
P6=P6*H< J> 

8 ( J ) =0 . 

A9=A9*P6 

I T ( L P4 ♦ 1 >53,54,5 3 
L1=LP4*1 

ir<LP4.GT.LP2.AND.LP2.GE.l > GO  TO  54 
L?  = Ll 

IF (LP4-LP2 >3180.3181,3181 
L?=LPP*1 

REJECT  POINTS 

00  333  JsLl.l? 

P7=P7*R< J> 

U<  J>  = 0. 

R9=R9»P7 

P8=P/*P6 

ACCUMULATE  FXPFCTED  VALUES 


N9=N9*P8»N1 

V9=V9*P8»N2 

I F < .MOT.OSN)  GU  TO  3500 
IF(N9)35o, 3540, 356 
P10=l.-R9-A9 

WK  I TE ( I OUT, 4 45  ) I ,P6, P7,P8,P10 
FORMAT (IX, I5.4F10.5) 

F OKMA T ( 3 X , 20  I fi ) 

I F ( LP1  .E0.-1.0R.LP4.E0.-1>  GO  TO  3540 

CHECK  FOR  .NATURAL  END  Of  TEST 

I F ( LP4 -LPC . Lfc . 1 > GU  TO  3580 
00  996  J = 1 . 13 
A( J>=H( J) 

continue 

V9=V9-N9«N9 
HO  = I 

continue 

I F ( .NOT. DSN)  GO  10  36811 


5540 

5550  3600 

5560  401 

5570  22 

5080 

5590 

5600 

5610 

5620 

5630 

5640  4212 

5650 

5660 

5670 

5680C 

569UC 

5700C 

5710  4213 

5720 

5730  16 

5740 

5750 

5760 

5770  17 

5780 

579U  1 

5800 

5810 

5020 

5030  426 

5840 

5b5(j 

5860  215 
5870 

5«8n  1316 

5890 

5900 

5910  1319 
5920 

5 9 3 0 216 
5940  99 
5950 

5940C 

5970C 

5980C 

5990G 

6 0 0 0 C 

6010C 

6020 

6030 

61140 


WRI TEII0UT.432) 

WRI TEC  IOUT,401  ) D,A9,R9,N9,V9 
FORH4TI1X,  I6,2F12.6,2F12.4  ) 

CONTINUE 

1FI.NOT.EST)  BO  TO  99 
DO  216  N1=2,H0 
KTO=l 

|F< X!NI).L£.X!NT-1  ) > CO  TO  4212 
NO=  X ! NT  ) 

00  TO  4213 

I F ! Y ( NT ) . FQ«  *1  ) GO  TO  216 

|F!Y!NT).GT.Y!NT-i:.AND.Y<NT-l).NF.-l>  60  TO  216 
ND  = Y (NT  ) 

KT0  = ? 

r I rill  CONFIDENCE  INTERVAL 

CALL  Cl  (XM,  I III,  I DU,  I L#  ltl»PROB*PP»NT»ND,N) 

MR  1 T£ { 1 OllT , 1 6 I NO, NT  » XH»  II.,  1U,PP 
FORHAT1//1X,  14,"  DLFECT IVES  AT  TRIAL  *,14, 

S " ESTIMATE:  " » F7 . 3/ " LOWER  LIM 

S I T=  -.14."  UPPER  LIMIT=  ",I4/«  CONFIDENCE  LFVEL*  *,F6.4) 
HR  I TEC  I OUT, 17) 

FORMAT!  /"  POSTERIOR  DISTRIBUTION  OF  D"/> 

I S = 1 • 1 STI  P 
ISs  IS* I STEP 

1 F=  I S* I S I FP-1 
IE=H|N0( IE, intn 
UO  426  |K=IS. IE 
IIIOLlX  IK)*IK-1 

HR  I TF( I OUT, 449 H I HOLD! I >, l = IS* IE) 

WRITE! I OUT , 215 ) ( PROR ! 1 ) , l = IS, IE) 

FOKMAT!6X,20F6,3) 

WRI TM 10UT, 1316) 

FORMAT ! Ill  ) 

If  ! IF.NF. IDU)  GO  TO  1 
HR!  re  < I OUT, 1319) 

FORMAT !lX,H0!tH— ) ) 

IF  ! K TO.EO. 1 ) GU  TO  4212 

CONTINUE 

HE  1 'IHN 

END 


THIS  SUBROUTINE  FINDS  THE  SHALLEST  CONFIDENCE  INTERVAL 
FOR  THE  TRUF  NUMBER  UF  DEFECTIVES 


S1HIROHT  INE  CKXHEAN,  I Dl  , IP!1,  IL,  I U,  PROR , P,  N T , ND,  N ) 
DIMENSION  PROB ( 200  ) 

DA  I A CIL/.17 


*4 


6050  P=0. 

6(160  XMMN=0. 

6070  I 0U=N-NT*ND*1 

6080  10L  = M)*1 

61190  I«1  999  1=1  , I Ul 

6100  999  PPUR ( I ) = 0 , 0 

6110  PSMH=0.0 

6120  00  77  I = 1 0 L « 1 DU 

6130  J=J-1 

6140  PPOP ( I )=EXP<FLNC(J>*FLNG{N-J)-rLNG<  J-NO ) -FLNG < N- J-NT»ND  ) ) 

6150  77  PSUM=PSUH*PR(1P(  I ) 

6160  00  88  I = I DL  » 1PU 

6170  88  PROBtt  >=PR<JB<I  J/PSUH 

6180  no  22  1 = i nt»  mu 

6190  22  XPf AN=XHfcAN*PR0B(l >«FLOAT< 1-1 > 

6200  1 11=  I no 

6210  11=1 

6220  1 ir(PHOB( 11 I.LT.PROBCIUI)  GO  TO  2 

6230  PH  = P*PROll<  IU) 

6240  IT (PH.GT.CIL)  GO  TO  99 

6250  P=PH 

6260  IW=H»-1 

6270  GO  TO  1 

6280  2 PH  = P*PICOH<  I L > 

6290  iriPH.CT.CII.)  GO  TO  99 

6300  P=PH 

6310  11=11*1 

6320  GO  TO  1 

6330  99  P=l.-P 

6340  10=10-1 

6350  11=11-1 

6360  KFIIIHN 

6370  eon 

6380C 

6J90C  FUNCTION  TO  RfcTORN  1HI  NATURAL  LOG  FACTORIAL 

6400  HWCriON  F L N G ( J ) 

6410  li  | HF NS  I ON  F ( 105  ) 

6420  li  A T A HAR6/1/ 

6430  IF (MARX >20,20,21 

6440  20  F l KG  = F ( J* 1 ) 

6450  HFIIIKN 

6460  21  F ( 1 ) = 0 . 

6470  F'2)=0. 

6480  00  72  1=3,103 

6490  F < I ) =F  ( I-l»*ALOGCFLOAT(l-l)) 

6500  2?  CONTINUE 

6510  HARK=0 

6520  GO  TCI  20 

6530  END 

654QC 


1000 
1010 
1020 
1030 
10*0 
1050 
1060 
1070 
1080 
1090 
1100 
1110 
1120 
1130 
1140 
1150 
1160 
1170 
1180 
1190 
1200 
1230 
1240 
1250 
1260 
1270 
1280 
1290 
1300 
1310 
1320 
1330 
1340 
1350 
1360 
1370 
1380 
1390 
1400 
1 4 1 0 
1420 
1430 
1440 
1450 


REM 

REM#**# »«<>*•#•#•*•*«•••• • «•»•#*«•#*****•»  **«>->*«  *#•*»***»*»»•» 

PEM# 

PEMs 
PEM# 

REM# 

REM# 

PEM# 

REM# 

REM# 

REM* 

REM# 

REM* 

PEM# 

REM# 

REM# 

REM# 

REM# 

REM* 

REfi#»»»»« »*«»« »•#•**###*#«*»**•««#* ft***##**#######*#*********** 

REM 

OEF  FN2(X)  = INT (X#l 00000, *.5J / 100000. 

DIM  A S f 3 J 

S 0F  ‘ c,,eN  ""•'•‘“(werRIC  TEST  REGION" 

read  DO.Dl fD5,N 
PRINT 

PILES  regon> 

DIM  21101.41 
READ  #1,1 

PEM 

PEM  READ  PFGION  FROM  FILE~#l 

MAT  READ  #1 ;2 
LET  T7r 1 

POP  1=1  To  100 

LET  XII  ) = 7(I,11 
LET  YC I 1=7| 1*21 
LET  UI I J = /( 1.31 
LET  VI  I l=7( I ,4) 

NEXT  I 

LET  Mls2(101»ll 
LET  M2=2 ( 101 .3  J 
LET  MO  = Ml  MAX  M2 


THIS  PROGRAM  WILL  EVALUATE  A THREE  DECISION 
SEoUENTlAt  TEST  REGION.  THE  REGION  MAY  BE  TRUNCATED. 
THE  TEST  REGION  IS  TO  BE  READ  FROM  FILE 
REr.O‘i>.  THE  ASN  ANO  OC  FUNCTIONS  ARE  PROVIDED.  THE 
DISTRIBUTION  OF  THE  DECISIVE  SAMPLE  NUMBER  MAY  BE 
PRINTED  AS  AN  OPTION. 


WILLIAM  0.  MEEKER*  JR. 
INSTITUTE  OF  ADMINISTRATION 
AND  MANAGEMENT 
UNION  COLLEGE 

SCHENECTADY*  NEW  YORK  12308 
JULY  1974 


1460  PRINT  «N=»{N 
1470  PRINT 
1480  LET  01=0 

1490  PRINT  "00  YOU  WANT  TO  SEE  THE  DSN"{ 

1500  INPUT  AS 

1510  IF  AS  <>  "YES"  THEN  1720 

1520  LET  01=1 

1530  GOTO  1840 

15*0  PRINT 

1550  PRINT 

1560  PRINT  "TRUE"! TAB (13) ; "PROS" 5 TAB (26) {"PROB" 1 
1570  PRINT  TAB(38) !"PR0B" 

1580  PRINT  " D"!TAB(IH! "ACCEPT  H1"!TAB«241  {"ACCEPT  HO»t 
1590  PRINT  TABt36) {"ACCEPT  H2"{TABf53) !"ASN»{TAB(65) {"VSN" 

1600  REM 

1610  REM  SPECITY  TRUE  DEFECTIVES 

1620  REM 

1630  FOR  0=00  TO  01  STEP  05 

1640  LET  A9=R9=B9=V9=N9=0 

1650  MAT  A=ZER 

1660  IF  01=0  ThEN  2140 

1670  PRINT 

1680  PRINT 

1690  PRINT  "NUMBER  OF  0EFECTIVES=" {0 

1700  PRINT  "T° I AL" {TAB  f 13) { "PROB" ! TAB (26) ! "PROS** 

1710  PRINT  TAB(3fl) ! "PPOU" { TAB (52 J ! "TOTAL" ! TAB (64) {"PROB" 

1720  PRINT  "NUMfaER"{TAn(ll) {"ACCEPT  HI "{ TAB (24 ){ "ACCEPT  H0"{ 
1730  PRINT  TAB{36) {“ACCEPT  H2" { TAB ( 52) ! "PROB" { 

1740  PRINT  TAfi(62) {"CONTINUE" 

1750  REM 

1760  PEK  SPCCIFY  TRIAL  NUMBER 

1770  REM 

1780  POR  Nl=l  TO  MO 
1790  LET  N2=Nl»Nl 
1800  MAT  B=ZER 
1910  IF  N1>1  Then  2280 
1820  LET  Bl 2 J=D/N 
1830  LET  B( 1 I = X — B f 2 ) 

18*0  GOTO  3880 
1850  LET  PI =XI Ni-i j 
1860  LET  P4=V( Nl-1  ] 

1870  LET  P0=0  MAX  PI 
1880  IF  Pl=-1  Then  2400 
1890  LET  ”8  = P4  HIN  N1 
1900  GOTO  2480 
1910  LET  P0=N1 

1920  REM 

1930  REM  FIGURE  PROBABILITIES  AT  THE  NEXT  STEP 

1940  REM 

1950  FOR  J=P9  TO  P8 

I960  LET  $=(0  MAX  (D-J) ) / (N-Nl ♦ 1 ) 


1970  LET  BI J»1  I=BI J*1 WAf J*1 1#(1-5J 
1980  LET  B(  J*2  |=B{  J*2  UAf  J*1  ]#S 
1990  NEXT  J 

2000  REM 

2010  PEM  ALLOW  PRINTING  OF  INOlVlDUAL  PROBABILITIES 

2020  REM 

2030  LET  A$=»Nn 

2090  IF  AS  <>  "Y"  THEN  2820 

2050  FOR  IrO  To  Nl+2 

2060  IF  B( I ♦ 1 ]:0  THEN  2790 

2070  PRINT  Nl * 1 161 1*11 

2080  NEXT  I 

2090  REM 

2100  REM  FIGURE  PROBABILITIES  OF  TERMINATION 

2110  REM 

2120  LET  P3=u(Nl  ] 

2130  LET  P9=vlNll 
2190  LET  Pl=xlNll 
2150  LET  P2=Yl Nl  I 
2160  LET  P5=P6=P8=0 

2170  IF  P3 <P2  OR  P3=-l  OR  P2=-l  THEN  3190 

2180  REM 

2190  REM  FIGURE  PROBABILITIES  FOR  ACCEPT 

2200  REM .... 

2210  FOR  J=P2  JO  P3 
2220  IF  T7=l  ThEN  3060 

2230  print  "y";j;ei j*i i 

2290  LET  P6=P6*BIJ*11 
2250  LET  BIJ*ll=0 
2260  NEXT  J 
2270  LET  B9=B9«P6 
2280  IF  Pl=-l  THEN  3360 

2290  REM 

2300  REM  FJGUPF  ProBABILITIES  FOR  REJL 

2310  REM 

2320  FOR  J=(0  MAX  XfU-ll)  TO  PI 

2330  IF  T7=l  ThEN  3280 

2390  PRINT  "X-SJSEf J*1 1 

2350  LET  P5=P5*8fJ*lJ 

2360  LET  BlJ*ll=0 

2370  NEXT  J 

2380  LET  A9=A9*P5 

2390  IF  P«=_l  THEN  3690 

2900  LET  Ll=L2=P9 

2910  IF  Pt,  >=  Y(  Nl-1  ) THEN  3500 

2920  LET  L2=YI Nl-1  I 

2930  REM 

2990  REM  FIGURE  PROBABILITIES  FOR  REJU 

2950  REM 

2960  FOR  J=L 1 TO  L2 
2970  IF  T7=l  ThEN  3560 


142 


print 

LET  P8=PfUBIJ»l1 
LET  Bt  J*  1 1=0 
NEXT  J 
LET  R9~.R9.P8 
LET  P9=P5.P6*P£i 
LET  V9= V9*N2»P9 
LET  N9~N9.N1«P9 
IF  01=0  ThEN  3840 
IF  N9=0  Then  3880 

REM 

REM  PRINT  DSN  IF  DESIRED 

REM - 

PRINT  NUTABdl)  SFN2(P5)  ;TAB(29J  !FNZ<P6>  ;TAB<36>  JFNZIP8)  I 
PRINT  TAB(SO) ;FN7(P9) ; TAB  £62 } SFNZ ! I-R9-A9-B9) 

IF  PI=-l  OR  P2=-l  OR  P3=-l  OR  P4=-l  THEN  3880 

IF  P2-P1  <=  1 AND  P4-P3  <=  1 THEN  3920 

MAT  A=B 
NEXT  N1 

LET  V9=V9-N9.2 
IF  01=0  Then  4080 
PRINT 
PRINT 

PRINT  «TRuE«;TABtl3) 5"PROB";TAB(26>  5"PR0B"I 
PRINT  TAB(3e» ;"PPOB" 

PRINT  » C" ;TA6 1 J I ) • "ACCEPT  H1";TAB{2A) ; "ACCEPT  HO"I 
PRINT  TAet36) ;"ACCEPT  H2« ; TAB ( 53 ) 5 " ASN" ; TAB (65 > ; "VSN" 
PRINT  D;TaB(11) :FNZ(A9J I TAB (24) ;FNZ<B9) I 
PRINT  TAB  (36)  ;FNZ(R9J  STAB.'SO)  S FNZ <N9) ? TAB { 62 ) 5FNZIV9J 
NEXT  D 

DATA  20.60.4,100 
END 


